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Preface 

Read This First 



About This Manual 

This user's guide serves as a reference book for the TMS320C3x generation 
of digital signal processors, which includes the TMS320C30, TMS320C30-27, 
TMS320C30-40, TMS320C31 , TMS320C31 -27, TMS320C31 -40, 
TMS320C31 -50, TMS320LC31 , and TMS320C31 PQA. Throughout the book, 
all references to 'C3x refer collectively to 'C30 and 'C31 , and the TMS320C30 
and TMS320C31 refer to all speed variations unless an exception is noted. 
This document provides information to assist managers and 
hardware/software engineers in application development. 

How to Use This Book 

This revision of the TMS320C3x User's Guide incorporates the following 
changes: 

Q Updated reference list of publications 

□ Improved description of repeat modes and interrupts in Chapter 6 

□ Description of power management modes in Chapter 6 

□ Improved description of serial ports and DMA coprocessor in Chapter 8 

□ Description of power management instructions in Chapter 1 0 

□ Description of low-power-mode interrupt interface in Chapter 12 

□ More detailed information on MPSD emulator interface, signal timings, 
and connections between emulator and target system 

□ Current timing specification in Chapter 1 3 

□ TMS320C30PPM pinout, mechanical drawing, and timings in Chapter 1 3 

□ Development support description and device/tool part numbers in 
Appendix B 

□ Data sheet for current military versions of the 'C3x in Appendix E 
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Notational Conventions 



Notational Conventions 



This document uses the following conventions: 

□ Program listings, program examples, interactive displays, filenames, and 
symbol names are shown in a special font. Examples use a bold version 
of the special font for emphasis. Here is a sample program listing: 



□ In syntax descriptions, the instruction, command, or directive is in a bold 
face font and parameters are in italics. Portions of a syntax that are in 
bold face should be entered as shown; portions of a syntax that are in 
italics describe the type of information that should be entered. Here is an 
example of a directive syntax: 

.asect "section name", address 

.asect is the directive. This directive has two parameters, indicated by 
section name and address. When you use .asect, the first parameter must 
be an actual section name, enclosed in double quotes; the second 
parameter must be an address. 

□ Square brackets ( [ and ] ) identify an optional parameter. If you use an 
optional parameter, you specify the information within the brackets; you 
don't enter the brackets themselves. Here's an example of an instruction 
that has an optional parameter: 

LALK 1 6-bit constant [, shift] 

The LALK instruction has two parameters. The first parameter, 16-bit 
constant, is required. The second parameter, shift, is optional. As this 
syntax shows, if you use the optional second parameter, you must 
precede it with a comma. 

Square brackets are also used as part of the pathname specification for 
VMS pathnames; in this case, the brackets are actually part of the 
pathname (they are not optional). 

□ Braces ( { and } ) indicate a list. The symbol | (read as ot) separates items 
within the list. Here's an example of a list: 



This provides three choices: *, *+, or *-. 

Unless the list is enclosed in square brackets, you must choose one item 
from the list. 



0011 0005 0001 

0012 0005 0003 

0013 0005 0006 

0014 0006 



.field 
.field 
.field 



If 2 
3, 4 
6, 3 



• even 



{*|*+|*-} 
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□ Some directives can have a varying number of parameters. For example, 
the .byte directive can have up to 1 00 parameters. The syntax for this 
directive is 

■byte valuei [,..., value n ] 

This syntax shows that .byte must have at least one value parameter, but 
you have the option of supplying additional value parameters separated 
by commas. 



Information About Cautions 



This book may contain cautions and warnings. 

□ A caution describes a situation that could potentially cause your system 
to behave unexpectedly. 



This is what a caution looks like. 



The information in a caution is provided for your information. Please read each 
caution carefully. 



Related Documentation From Texas Instruments 

The following books describe the TMS320 floating-point devices and related 
support tools. To obtain a copy of any of these Tl documents, call the Texas 
Instruments Literature Response Center at (800) 477-8924. When ordering, 
please identify the book by its title and literature number. 

TMS320 Floating-Point DSP Assembly Language Tools User's Guide 

(literature number SPRU035) describes the assembly language tools 
(assembler, linker, and other tools used to develop assembly language 
code), assembler directives, macros, common object file format, and 
symbolic debugging directives for the 'C3x and 'C4x generations of 
devices. 

TMS320 Floating-Point DSP Optimizing C Compiler User's Guide 

(literature number SPRU034) describes the TMS320 floating-point C 
compiler. This C compiler accepts ANSI standard C source code and 
produces TMS320 assembly language source code for the 'C3x and 
'C4x generations of devices. 
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Related Documentation from Texas Instruments / References 



TMS320C3x C Source Debugger (literature number SPRU053) describes 
the 'C3x debugger for the emulator, evaluation module, and simulator. 
This book discusses various aspects of the debugger interface, including 
window management, command entry, code execution, data 
management, and breakpoints. It also includes a tutorial that introduces 
basic debugger functionality. 

TMS320 Family Development Support Reference Guide (literature number 
SPRU011) describes the '320 family of digital signal processors and the 
various products that support it. This includes code-generation tools 
(compilers, assemblers, linkers, etc.) and system integration and debug 
tools (simulators, emulators, evaluation modules, etc.). This book also 
lists related documentation, outlines seminars and the university 
program, and provides factory repair and exchange information. 

TMS320 Third-Party Support Reference Guide (literature number 
SPRU052) alphabetically lists over 1 00 third parties who supply various 
products that serve the family of '320 digital signal processors, including 
software and hardware development tools, speech recognition, image 
processing, noise cancellation, modems, etc. 



References 

The publications in the following reference list contain useful information 
regarding functions, operations, and applications of digital signal processing 
(DSP). These books also provide other references to many useful technical 
papers. The reference list is organized into categories of general DSP, speech, 
image processing, and digital control theory and is alphabetized by author. 

□ General Digital Signal Processing: 

Antoniou, Andreas, Digital Filters: Analysis and Design. New York, NY: 
McGraw-Hill Company, Inc., 1979. 

Bateman, A., and Yates, W., Digital Signal Processing Design. Salt Lake 
City, Utah: W. H. Freeman and Company, 1990. 

Brigham, E. Oran, The Fast Fourier Transform. Englewood Cliffs, NJ: 
Prentice-Hall, Inc., 1974. 

Burrus, C.S., and Parks, T.W., DFT/FFT and Convolution Algorithms. New 
York, NY: John Wiley and Sons, Inc., 1984. 

Chassaing, R., and Horning, D., Digital Signal Processing with the 
TMS320C25. New York, NY: John Wiley and Sons, Inc., 1990. 

Digital Signal Processing Applications with the TMS320 Family Vol. I. 
Texas Instruments, 1986; Prentice-Hall, Inc., 1987. 
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Order Texas Instruments 
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operation or report suspected 


(713)274-2320 


problems 


FAX: (713) 274-2324 
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European fax line: +33-1-3070-1032 



Report mistakes in this document Fill out and return the reader response card at 
or any other Tl documentation the end of this book, or send your comments to: 

Texas Instruments Incorporated 
Technical Publications Manager, MS 702 
RO. Box 1443 

Houston, Texas 77251 -1 443 
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Chapter 1 



Introduction 



The TMS320C3x generation of digital signal processors (DSPs) are high-per- 
formance CMOS 32-bit floating-point devices in the TMS320 family of 
single-chip digital signal processors. Since 1 982, when the TMS3201 0 was in- 
troduced, the TMS320 family, with its powerful instruction sets, high-speed 
number-crunching capabilities, and innovative architectures, has established 
itself as the industry standard. It is ideal for DSP applications. 

The 40-ns cycle time of the TMS320C31 -50 allows it to execute operations at 
a performance rate of up to 60 million floating-point instructions per second 
(MFLOPS) and 30 million instructions per second (MIPS). This performance 
was previously available only on a supercomputer. The generation's perform- 
ance is further enhanced through its large on-chip memories, concurrent direct 
memory access (DMA) controller, and two external interface ports. 

This chapter presents the following major topics: 



Topic Page 

1.1 General Description 1-2 

1.2 TMS320C30 Key Features 1*6 

1.3 TMS320C31 KeyFeatures 1-8 

1.4 Typical Applications 1-10 
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General Description 



1.1 General Description 



The TMS320 family consists of five generations: TMS320C1x, TMS320C2x, 
TMS320C3x, TMS320C4x, and TMS320C5x (see Figure 1-1). The expan- 
sion includes enhancements of earlier generations and more powerful new 
generations of DSPs. 

The TMS320's internal busing and special DSP instruction set have the speed 
and flexibility to execute at up to 50 MFLOPS. The TMS320 family optimizes 
speed by implementing functions in hardware that other processors imple- 
ment through software or microcode. This hardware-intensive approach pro- 
vides power previously unavailable on a single chip. 

The emphasis on total system cost has resulted in a less expensive processor 
that can be designed into systems currently using costly bit-slice processors. 
Also, cost/performance selection is provided by the different processors in the 
TMS320C3x generation: 

□ TMS320C30: 60-ns, single-cycle execution-time 



□ TMS320C30-27: 

□ TMS320C30-40 

□ TMS320C30-50 



Lower cost; 74-ns, single-cycle execution time 
Higher speed; 50-ns, single-cycle execution time 
Highest speed; 40-ns, single-cycle execution time 

□ TMS320C31 : Low cost; 60-ns, single-cycle execution time 

□ TMS320C31 -27: Lower cost; 74-ns, single-cycle execution time 

□ TMS320C31 -40: Low cost; 50-ns, single-cycle execution time 

□ TMS320C31 PQA: Low cost; extended temperature; 60-ns, single-cycle 

execution time 

□ TMS320C31 -50: Highest speed; 40-ns, single-cycle execution time 

□ TMS320LC31 : Low power; 60-ns, single-cycle execution time, 

3.3-volt operation 

All of these processors are described in this user's guide. Essentially, their 
functionality is the same. However, electrical and timing characteristics vary 
(as described in Chapter 13); part numbering information is found in Section 
B.2 on page B-7. Throughout this book, TMS320C3x is used to refer to the 
TMS320C30 and TMS320C31 and all speed variations. TMS320C30 and 
TMS320C31 are used to refer to all speed variants of those processors where 
appropriate. Special references, such as TMS320C30-40, are used to note 
specific exceptions. 
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General Description 



Figure 1-1. TMS320 Device Evolution 
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The TMS320C30 and TMS320C31 can perform parallel multiply and arithme- 
tic logic unit (ALU) operations on integer or floating-point data in a single cycle. 
The processor also possesses a general-purpose register file, a program 
cache, dedicated auxiliary register arithmetic units (ARAU), internal dual-ac- 
cess memories, one DMA channel supporting concurrent I/O, and a short ma- 
chine-cycle time. High performance and ease of use are products of those fea- 
tures. 

General-purpose applications are greatly enhanced by the large address 
space, multiprocessor interface, internally and externally generated wait 
states, two external interface ports (one on the TMS320C31), two timers, two 
serial ports (one on the TMS320C31), and multiple interrupt structure. The 
TMS320C3x supports a wide variety of system applications from host proces- 
sor to dedicated coprocessor. 

High-level language is more easily implemented through a register-based ar- 
chitecture, large address space, powerful addressing modes, flexible instruc- 
tion set, and well-supported floating-point arithmetic. 



General Description 



Figure 1-2 is a functional block diagram that shows the interrelationships be- 
tween the various TMS320C3x key components. 



Figure 1-2. TMS320C3x Block Diagram 
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TMS320C30 Key Features 



1.2 TMS320C30 Key Features 

Some key features of the TMS320C30 are listed below. 

□ Performance 

■ TMS320C30 (33 MHz) 

■ 60-ns, single-cycle instruction execution time 

■ 33.3 MFLOPS 

■ 16.7 MIPS 

■ TMS320C30-27 

■ 74-ns, single-cycle instruction execution time 

■ 27 MFLOPS 

■ 13.5 MIPS 

■ TMS320C30-40 

■ 50-ns, single-cycle instruction execution time 

■ 40 MFLOPS 

■ 20 MIPS 

□ One 4K x 32-bit, single-cycle, dual-access, on-chip, read-only memory 
(ROM) block 

□ Two 1K x 32-bit, single-cycle, dual-access, on-chip, random access 
memory (RAM) blocks 

□ 64- x 32-bit instruction cache 

□ 32-bit instruction and data words 

□ 24-bit addresses 

□ 40-/32-bit floating-point/integer multiplier and ALU 

□ 32-bit barrel shifter 

□ Eight extended-precision registers (accumulators) 

□ Two address generators with eight auxiliary registers and two auxiliary 
register arithmetic units 

□ On-chip DMA controller for concurrent I/O and CPU operation 

□ Integer, floating-point, and logical operations 

□ Two- and three-operand instructions 

□ Parallel ALU and multiplier instructions in a single cycle 
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TMS320C30 Key Features 



□ Block repeat capability 

□ Zero-overhead loops with single-cycle branches 

□ Conditional calls and returns 

□ Interlocked instructions for multiprocessing support 

□ Two 32-bit data buses (24- and 1 3-bit address) 

□ Two serial ports to support 8/1 6/24/32-bit transfers 

□ Two 32-bit timers 

□ Two general-purpose external flags; four external interrupts 

□ 1 81 -pin grid array (PGA) package; 1 -urn CMOS 
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TMS320C31 Key Features 



1.3 TMS320C31 Key Features 

The TMS320C31 is a low-cost 32-bit DSP that offers the advantages of a floa- 
ting-point processor and ease of use. The TMS320C31 devices are object- 
code compatible with the TMS320C30. Aside from lacking a ROM block and 
having a single serial port, the TMS320C31 is functionally equivalent to the 
TMS320C30 but differs in its respective electrical and timing characteristics. 
Chapter 13 describes these differences in detail. 

□ The TMS320C31 (33 MHz) features are identical to those of the 
TMS320C30 device, except that the TMS320C31 uses a subset of the 
TMS320C30's standard peripheral and memory interfaces. This main- 
tains the 33-MFLOPS performance of the TMS320C30's core CPU while 
providing the cost advantages associated with 1 32-pin plastic quad flat 
pack (PQFP) packaging. 

□ The TMS320C31 -27 is the slower speed version of the TMS320C31 . The 
TMS320C31-27 delivers 27 MFLOPS and runs at 27 MHz. The reduced 
speed allows you to realize an immediate system cost reduction by using 
slower off-chip memories and a lower-cost processor. 

□ The TMS320C31-40 is a high-speed version of the TMS320C31. The 
40-MHz TMS320C31-40 runs with 50-ns cycle time and offers up to 40 
MFLOPS in performance. 

□ The TMS320C31 -50 is the highest-speed version of the TMS320C31 . The 
50-MHz TMS320C31-50 runs with 40-ns cycle time and offers up to 50 
MFLOPS in performance. 

□ The TMS320C31 PQA (33 MHz) offers extended-temperature capabilities 
to TMS320C31 performance. The TMS320C31 PQA will operate at case 
temperatures ranging from -40° C to +85° C, making it a lower-cost floa- 
ting-point solution for industrial and extended-temperature commercial 
applications. 

□ The TMS320LC31 is the low-power version of the TMS320C31 . The 
TMS320LC31 runs with 60-ns cycle time and offers up to 33 MFLOPS in 
performance at 3.3-volt operation. 

Some key features of the TMS320C31 , including those which differentiate it 
from the TMS320C30, are summarized as follows: 

□ Performance 

■ TMS320C31 (PQUPQA) 

■ 60-ns, single-cycle instruction execution time 

■ 33.3 MFLOPS 

■ 1 6.7 MIPS (million instructions per second) 
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TMS320C31 Key Features 



■ TMS320C31-27 

■ 74-ns, single-cycle instruction execution time 

■ 27 MFLOPS 

■ 13.5 MIPS 

■ TMS320C31-40 

■ 50-ns, single-cycle instruction execution time 

■ 40 MFLOPS 

■ 20 MIPS 

■ TMS320C31-50 

■ 40-ns, single-cycle instruction execution time 

■ 50 MFLOPS 

■ 25 MIPS 

■ TMS320LC31 

■ 60-ns, single-cycle instruction execution time 

■ 33.3 MFLOPS 

■ 16.7 MIPS 

■ Low-power, 3.3 volt operation 

■ Two power-down nodes; 2-MHz operation and idle 

□ Flexible boot program loader 

□ One serial port to support 8-/1 6-/24-/32-bit transfers 

□ 132-pin PQFP package, .8 urn CMOS 
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Typical Applications 



1 .4 Typical Applications 

The TMS320 family's versatility, real-time performance, and multiple functions 
offer flexible design approaches in a variety of applications, which are shown 
in Table 1-1. 



Table 1-1. Typical Applications of the TMS320 Family 
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Chapter 2 



TMS320C3x Architecture 



This chapter gives an architectural overview of the TMS320C3x processor. 
Major areas of discussion are listed below. 

Topic Page 

2.1 Architectural Overview . 2-2 

2.2 Central Processing Unit (CPU) 2-4 

2.3 Memory Organization 2-11 

2.4 Instruction Set Summary 2-17 

2.5 Internal Bus Operation 2-22 

2.6 Parallel Instruction Set Summary 2-23 

2.7 External Bus Operation ....................................... 2-26 

2.3 Peripherals 2-27 

2.9 Direct Memory Access (DMA) 2-29 

2.10 TMS320C30 and TMS320C31 Differences 2-30 

2.11 System Integration 2-32 
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Architectural Overview 



2.1 Architectural Overview 

The TMS320C3x architecture responds to system demands that are based on 
sophisticated arithmetic algorithms and that emphasize both hardware and 
software solutions. High performance is achieved through the precision and 
wide dynamic range of the floating-point units, large on-chip memory, a high 
degree of parallelism, and the direct memory access (DMA) controller. 

Figure 2-1 is a block diagram of the TMS320C3x architecture. 
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Architectural Overview 



Figure 2-1. TMS320C3x Block Diagram 
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Central Processing Unit (CPU) 



2.2 Central Processing Unit (CPU) 

The TMS320C3x has a register-based central processing unit (CPU) architec- 
ture. The CPU consists of the following components: 

□ Floating-point/integer multiplier 

□ Arithmetic logic unit (ALU) for performing floating-point, integer, and log- 
ical-operations arithmetic 

□ 32-bit barrel shifter 

□ Internal buses (CPU1/CPU2 and REG1/REG2) 

□ Auxiliary register arithmetic units (ARAUs) 

□ CPU register file 

Figure 2-2 shows the various CPU components that are discussed in the 
succeeding subsections. 
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Central Processing Unit (CPU) 



Figure 2-2. Central Processing Unit (CPU) 
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2.2.1 Multiplier 

The multiplier performs single-cycle multiplications on 24-bit integer and 32-bit 
floating-point values. The TMS320C3x implementation of floating-point arith- 
metic allows for floating-point operations at fixed-point speeds via a 50-ns in- 
struction cycle and a high degree of parallelism. To gain even higher through- 
put, you can use parallel instructions to perform a multiply and ALU operation 
in a single cycle. 

When the multiplier performs floating-point multiplication, the inputs are 32-bit 
floating-point numbers, and the result is a 40-bit floating-point number. When 
the multiplier performs integer multiplication, the input data is 24 bits and yields 
a 32-bit result. Refer to Chapter 4 for detailed information on data formats and 
floating-point operation. 

2.2.2 Arithmetic Logic Unit (ALU) 

The ALU performs single-cycle operations on 32-bit integer, 32-bit logical, and 
40-bit floating-point data, including single-cycle integer and floating-point con- 
versions. Results of the ALU are always maintained in 32-bit integer or 40-bit 
floating-point formats. The barrel shifter is used to shift up to 32 bits left or right 
in a single cycle. Refer to Chapter 4 for detailed information on data formats 
and floating-point operation. 

Internal buses, CPU1/CPU2 and REG1/REG2, carry two operands from 
memory and two operands from the register file, thus allowing parallel multi- 
plies and adds/subtracts on four integer or floating-point operands in a single 
cycle. 

2.2.3 Auxiliary Register Arithmetic Units (ARAUs) 

Two auxiliary register arithmetic units (ARAUO and ARAU1) can generate two 
addresses in a single cycle. The ARAUs operate in parallel with the multiplier 
and ALU. They support addressing with displacements, index registers (IRO 
and IR1), and circular and bit-reversed addressing. Refer to Chapter 5 for a 
description of addressing modes. 
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2.2.4 CPU Register File 

The TMS320C3x provides 28 registers in a multiport register file that is tightly 
coupled to the CPU. All of these registers can be operated upon by the multipli- 
er and ALU and can be used as general-purpose registers. However, the regis- 
ters also have some special functions. For example, the eight extended-preci- 
sion registers are especially suited for maintaining extended-precision float- 
ing-point results. The eight auxiliary registers support a variety of indirect ad- 
dressing modes and can be used as general-purpose 32-bit integer and logical 
registers. The remaining registers provide such system functions as address- 
ing, stack management, processor status, interrupts, and block repeat. Refer 
to Chapter 6 for detailed information and examples of stack management and 
register usage. 

The register names and assigned functions are listed in Table 2-1 . Following 
the table, the function of each register or group of registers is briefly described. 
Refer to Chapter 3 for detailed information on each of the CPU registers. 
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Table 2-1. CPU Registers 



Register 




Name 


Assigned Function 


nu 


E^vtanHAH.rM'AfMeifM'i ran iota r f\ 


R1 


Extended-precision register 1 


R2 


Extended-precision register 2 


R3 


Extended-precision register 3 


R4 


Extended-precision register 4 


R5 


Extended-precision register 5 


R6 


Extended-precision register 6 


R7 


Extended-precision register 7 


APO 
nnu 


Ai iviliorv ronictor H 
rMJAIIIcUy lOyiolOl \J 


AR1 


Auxiliary register 1 


AR2 


Auxiliary register 2 


AR3 


Auxiliary register 3 


AR4 


Auxiliary register 4 


AR5 


Auxiliary register 5 


AR6 


Auxiliary register 6 


AR7 


Auxiliary register 7 


DP 


Data-page pointer 


IRO 


Index register 0 


IR1 


Index register 1 


BK 


Block size 


SP 


System stack pointer 


QT 
O I 


oiaius register 


IE 


CPU/DMA interrupt enable 


IF 


CPU interrupt flags 


IOF 


I/O flags 


RS 


Repeat start address 


RE 


Repeat end address 


RC 


Repeat counter 



The extended-precision registers (R7-R0) are capable of storing and sup- 
porting operations on 32-bit integer and 40-bit floating-point numbers. Any in- 
struction that assumes the operands are floating-point numbers uses bits 
39-0. If the operands are either signed or unsigned integers, only bits 31-0 
are used; bits 39-32 remain unchanged. This is true for all shift operations. 
Refer to Chapter 4 for extended-precision register formats for floating-point 
and integer numbers. 

The 32-bit auxiliary registers (AR7-AR0) can be accessed by the CPU and 

modified by the two ARAUs. The primary function of the auxiliary registers is 
the generation of 24-bit addresses. They can also be used as loop counters 
or as 32-bit general-purpose registers that can be modified by the multiplier 
and ALU. Refer to Chapter 5 for detailed information and examples of the use 
of auxiliary registers in addressing. 
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The data page pointer (DP) is a 32-bit register. The eight LSBs of the data 
page pointer are used by the direct addressing mode as a pointer to the page 
of data being addressed. Data pages are 64K words long, with a total of 256 
pages. 

The 32-bit Index registers (IRO, IR1) contain the value used by the ARAU to 
compute an indexed address. Refer to Chapter 5 for examples of the use of 
index registers in addressing. 

The ARAU uses the 32-bit block size register (BK) in circular addressing to 
specify the data block size. 

The system stack pointer (SP) is a 32-bit register that contains the address 
of the top of the system stack. The SP always points to the last element pushed 
onto the stack. A push performs a preincrement of the system stack pointer; 
a pop performs a postdecrement. The SP is manipulated by interrupts, traps, 
calls, returns, and the PUSH and POP instructions. Refer to Section 5.5 for in- 
formation about system stack management. 

The status register (ST) contains global information relating to the state of the 
CPU. Operations usually set the condition flags of the status register accord- 
ing to whether the result is 0, negative, etc. This includes register load and 
store operations as well as arithmetic and logical functions. When the status 
register is loaded, however, a bit-for-bit replacement is performed with the con- 
tents of the source operand, regardless of the state of any bits in the source 
operand. Therefore, following a load, the contents of the status register are 
identical to the contents of the source operand. This allows the status register 
to be easily saved and restored. See Table 3-2 for a list and definitions of the 
status register bits. 

The CPU/DMA interrupt enable register (IE) is a 32-bit register. The CPU 
interrupt enable bits are in locations 10-0. The DMA interrupt enable bits are 
in locations 26-1 6. A 1 in a CPU/DMA interrupt enable register bit enables the 
corresponding interrupt. A 0 disables the corresponding interrupt. Refer to 
subsection 3.1 .8 for bit definitions. 

The CPU interrupt flag register (IF) is also a 32-bit register (see subsection 
3.1 .9). A 1 in a CPU interrupt flag register bit indicates that the corresponding 
interrupt is set. A 0 indicates that the corresponding interrupt is not set. 

The I/O flags register (IOF) controls the function of the dedicated external 
pins, XFO and XF1 . These pins may be configured for input or output and may 
also be read from and written to. See subsection 3.1 .1 0 for detailed informa- 
tion. 
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The repeat counter (RC) is a 32-bit register used to specify the number of 
times a block of code is to be repeated when performing a block repeat. When 
the processor is operating in the repeat mode, the 32-bit repeat start address 
register (RS) contains the starting address of the block of program memory 
to be repeated, and the 32-bit repeat end address register (RE) contains the 
ending address of the block to be repeated. 

The program counter (PC) is a 32-bit register containing the address of the 
next instruction to be fetched. Although the PC is not part of the CPU register 
file, it is a register that can be modified by instructions that modify the program 
flow. 
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2.3 Memory Organization 

The total memory space of the TMS320C3x is 1 6M (million) 32-bit words. Pro- 
gram, data, and I/O space are contained within this 1 6M-word address space, 
thus allowing tables, coefficients, program code, or data to be stored in either 
RAM or ROM. In this way, memory usage is maximized and memory space 
allocated as desired. 

2.3.1 RAM, ROM, and Cache 

Figure 2-3 shows how the memory is organized on the TMS320C3x. RAM 
blocks 0 and 1 are each 1 K x 32 bits. The ROM block, available only on the 
TMS320C30, is 4Kx32 bits. Each RAM and ROM block is capable of support- 
ing two CPU accesses in a single cycle. The separate program buses, data 
buses, and DMA buses allow for parallel program fetches, data reads and 
writes, and DMA operations. For example: the CPU can access two data val- 
ues in one RAM block and perform an external program fetch in parallel with 
the DMA loading another RAM block, all within a single cycle. 
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Figure 2-3. Memory Organization 
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A 64 x 32-bit instruction cache is provided to store often-repeated sections of 
code, thus greatly reducing the number of off-chip accesses necessary. This 
allows for code to be stored off-chip in slower, lower-cost memories. The exter- 
nal buses are also freed for use by the DMA, external memory fetches, or other 
devices in the system. 

Refer to Chapter 3 for detailed information about the memory and instruction 
cache. 
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2.3.2 Memory Maps 

The memory map depends on whether the processor is running in micropro- 
cessor mode (MC/MP or MCBL/MP = 0) or microcomputer mode (MC/MP or 
MCBL/MP = 1). The memory maps for these modes are similar (see 
Figure 2-4 and Figure 2-5). Locations 800000h-801 FFFh are mapped to the 
expansion bus. When this region, available only on the TMS320C30, is ac- 
cessed, MSTRB is active. Locations 802000h-803FFFh are reserved. Loca- 
tions 804000h-805FFFh are mapped to the exp ansion bu s. When this region, 
available only on the TMS320C30, is accessed, IOSTRB is active. Locations 
806000h-607FFFh are reserved. All of the memory-mapped peripheral bus 
registers are in locations 808000h-8097FFh. In both modes, RAM block 0 is 
located at addresses 809800h-809BFFh, and RAM block 1 is located at ad- 
dresses 809C00h-809FFFh. L ocation s 80A000h-OFFFFFFh are accessed 
over the external memory port (STRB active). 

In microprocessor mode, the 4K on-chip ROM (TMS320C30) or boot loader 
(TMS320C31) is not mapped into the TMS320C3x memory map. Locations 
Oh-OBFh consist of interrupt vector, trap vector, and reserv ed locations, all of 
which are accessed over the external memory port (STRB active). Locations 
0C0h-7FFFFFh are also accessed over the external memory port. 

In microcomputer mode, the 4K on-chip ROM (TMS320C30) or boot loader 
(TMS320C31) is mapped into locations Oh-OFFFh. There are 192 locations 
(0h~0BFh) within this block for interrupt vectors, trap vectors, and a reserved 
space (TMS320C30 ). Locat ions 1000h-7FFFFFh are accessed over the ex- 
ternal memory port (STRB active). 

Section 3.2 on page 3-13 describes the memory maps in greater detail and 
provides the peripheral bus map and vector locations for reset, interrupts, and 
traps. 



Be careful! Access to a reserved area produces unpredictable 
results, 
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Figure 2-4. TMS320C30 Memory Maps 
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Figure 2-5. TMS320C31 Memory Maps 
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2.3.3 Memory Addressing Modes 

The TMS320C3x supports a base set of general-purpose instructions as well 
as arithmetic-intensive instructions that are particularly suited for digital signal 
processing and other numeric-intensive applications. Refer to Chapter 5 for 
detailed information on addressing. 

Five groups of addressing modes are provided on the TMS320C3x. Six types 
of addressing can be used within the groups, as shown in the following list: 

□ General addressing modes: 

■ Register. The operand is a CPU register. 

■ Short immediate. The operand is a 1 6-bit immediate value. 

■ Direct. The operand is the contents of a 24-bit address. 

■ Indirect. An auxiliary register indicates the address of the operand. 

□ Three-operand addressing modes: 

■ Register. Same as for general addressing mode. 

■ Indirect. Same as for general addressing mode. 

□ Parallel addressing modes: 

■ Register. The operand is an extended-precision register. 

■ Indirect. Same as for general addressing mode. 

□ Long-immediate addressing mode. 

The Long-immediate operand is a 24-bit immediate value. 

□ Conditional branch addressing modes: 

■ Register. Same as for general addressing mode. 

■ PC-relative. A signed 1 6-bit displacement is added to the PC. 
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2.4 Instruction Set Summary 

Table 2-2 lists the TMS320C3x instruction set in alphabetical order. Each 
table entry shows the instruction mnemonic, description, and operation. Refer 
to Chapter 1 0 for a functional listing of the instructions and individual instruc- 
tion descriptions. 

Table 2-2. Instruction Set Summary 



Mnemonic 


Daftcrintion 


w|jvi an wi i 


ABSF 




Absolute value of a floating-point number 


|src| -+ Rn 


ABSI 




Absolute value of an integer 


|src| -* Dreg 


ADDC 




Add integers with carry 


src + Dreg + C -* Dreg 


ADDC3 




Add integers with carry (3 operand) 


srd + src2 + C Dreg 


ADDF 




Add floating-point values 


src + Rn -* Rn 


ADDF3 




Add floating-point values (3 operand) 


srd + src2 -* Rn 


ADDI 




Add integers 


src + Dreg -* Dreg 


ADDI3 




Add integers (3 operand) 


srd + src2 + -* Dreg 


AND 




Bitwise logical AND 


Dreg AND src -* Dreg 


AND3 




Bitwise logical AND (3 operand) 


srd AND src2 -* Dreg 


ANDN 




Bitwise logical AND with complement 


Dreg AND src -* Dreg 


ANDN3 




Bitwise logical ANDN (3 operand) 


srd AND src2 -* Dreg 


ASH 




Arithmetic shift 


If count * 0: 

(Shitted Dreg left by count) — * Dreg 
Else: 

(Shifted Dreg right by |count|) Dreg 


ASH3 




Arithmetic shift (3 operand) 


If count 2s 0: 

(Shifted src left by count) Dreg 
Else: 

lonineu src ngni oy |couni|j — * ureg 


Bcond 




Branch conditionally (standard) 


If cond = true: 
If Csrc is a register, Csrc -* PC 
If Csrc is a value, Csrc + PC PC 

Else, PC + 1 — PC 


BcondD 




Branch conditionally (delayed) 


If cond= true: 
If Csrc is a register, Csrc PC 
If Csrc is a value, Csrc + PC + 3 -* PC 

Else, PC + 1 — PC 


BR 




Branch unconditionally (standard) 


Value -* PC 


BRD 




Branch unconditionally (delayed) 


Value — PC 


CALL 




Call subroutine 


PC + 1 -> TOS 
Value -* PC 


Legend: 


C 


carry bit 


Csrc conditional-branch addressing modes 




cond 


condition code 


count shift value (general addressing modes) 




Dreg 


register address (any register) 


PC program counter 




Rn 


register address (R7-R0) 


src general addressing modes 




srd 


three-operand addressing modes 


src2 three-operand addressing modes 
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Table 2-2. Instruction Set Summary (Continued) 



Mnemonic 


Description 


Operation 


CALLcond 


Call subroutine conditionally 


If cond = true: 
PC + 1 -* TOS 

If Csrc is a register, Csrc -* PC 
If Csrc is a value, Csrc + PC -* PC 
Else, PC + 1 -* PC 


CMPF 




Compare floating-point values 


Set flags on Rn - src 


CMPF3 




Compare floating-point values 
(3 operand) 


Set flags on srd - src2 


CMPI 




Compare integers 


Set flags on Dreg - src 


CMPI3 




Compare integers (3 operand) 


Set flags on srd - src2 


DBcond 




Decrement and branch conditionally 
(standard) 


ARn - 1 -* ARn 
If cond = true and ARn as 0: 
If Csrc is a register, Csrc -+ PC 

If C ^rn is a ja C^sm + PC 4- 1 — * PO, 

Else, PC + 1 PC 


DBcondD 




Decrement and branch conditionally 
(delayed) 


ARn - 1 ARn 

If cond= true and ARn * 0: 
If Csrc is a register, Csrc -* PC 
If Csrc is a value, Csrc + PC + 3 -* PC 

Else, PC + 1 PC 


FIX 




Convert floating-point value to integer 


Fix (src) -* Dreg 


FLOAT 




Convert integer to floating-point value 


Float(src) -* Rn 


IACK 




Interrupt acknowledge 


Dummy read of src 

IACK toggled low, then high 


IDLE 




idle until interrupt 


PC + 1 —> PC 

Idle until next interrupt 


LDE 




Load floating-point exponent 


src(exponent) -* Rn(exponent) 


LDF 




uUdu Tiucuii ly-puii 11 VCMU6 


src — * Rn 


LDFcond 




Load floating-point value conditionally 


If cond- true, src -* Rn 
Else, Rn is not changed 


LDFI 




Load floating-point value, interlocked 


Signal interlocked operation src -* Rn 


LDI 




Load integer 


src -* Dreg 


LDlcond 




Load integer conditionally 


If cond= true, src -* Dreg 
Else, Dreg is not changed 


Legend: 


ARn 

Csrc 

cond 

Dreg 

PC 


auxiliary register n (AR7-AR0 
conditional-branch addressing modes 
condition code 

register address (any register) 
program counter 


Rn register address (R7 — R0) 
src general addressing modes 
srd three-operand addressing modes 
src2 three-operand addressing modes 
TOS top of stack 
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Table 2-2. Instruction Set Summary (Continued) 



Mnemonic 


Description 


Operation 


LDII 




Load integer, interlocked 


Signal interlocked operation src Dreg 


LDM 




Load floating-point mantissa 


src (mantissa) Rn (mantissa) 


LSH 




Logical shift 


If count 2r 0: 

(Dreg left-shifted by count) -* Dreg 
else: 

(Dreg right-shifted by |count|) -* Dreg 


LSH3 




Logical shift (3-operand) 


If count 0: 

(src left-shifted by count) Dreg 
Else: 

(src right-shifted by |count|) -* Dreg 


MPYF 




Multiply floating-point values 


src x Rn -* Rn 


MPYF3 




Multiply floating-point value (3 operand) 


srcl x src2 Rn 


MPYI 




Multiply integers 


src x Dreg -* Dreg 


MPYI3 




Multiply integers (3 operand) 


srd x src2 Dreg 


NEGB 




Negate integer with borrow 


0- src- C Dreg 


NEGF 




Negate floating-point value 


0 - src Rn 


NEGI 




Negate integer 


0-src-* Dreg 


NOP 




No operation 


Modify ARn if specified 


NORM 




Normalize floating-point value 


Normalize (src) -* Rn 


NOT 




Bitwise logical complement 


sire -* Dreg 


OR 




Bitwise logical OR 


Dreg OR src -» Dreg 


OR3 




Bitwise logical OR (3 operand) 


srtf OR src2 Dreg 


POP 




Pop integer from stack 


*SP Dreg 


POPF 




Pop floating-point value from stack 


*SP * Rn 


PUSH 




Push integer on stack 


Sreg -* *++ SP 


PUSHF 




Push floating-point value on stack 


Rn -* *++ SP 


Legend: 


ARn 
C 

Dreg 

PC 

Rn 


auxiliary register n (AR7-AR0) 
carry bit 

register address (any register) 
program counter 
register address (R7-R0) 


SP stack pointer 
Sreg register address (any register) 
src general addressing modes 
srd 3-operand addressing modes 
src2 3-operand addressing modes 
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Table 2-2. Instruction Set Summary (Continued) 



Mnemonic 


Description 


Operation 


RET\cond 


Return from interrupt conditionally 


If cond- true or missing: 








*SP > pc 








1 -* ST (GIE) 








else, connnue 


RETScond 


Rpturn from <»iihrniitine rnnHitinnallx/ 

llwlvll 1 1 II Will 9UMlwvlllllw V/wl IvIIMUI ICMI Y 


If ennti — triiA or miQQinrr 

11 IsVsf l*J — 11 Uv \J\ IIIIOOIIiy> 








*SP ► PC 








cise, continue 


HNU 




nuunu Tioaiiny-poiru vaiu© 


Round (sre) — * Rn 


ROL 




Rotate left 


Dreg rotated left 1 bit Dreg 


nULv 




Rotate left through carry 


Dreg rotated left 1 bit through carry — * Dreg 


ROR 




notate ngnt 


Dreg rotated right 1 bit Dreg 


RORC 




Rotate right through carry 


DrAfi rntataH rinht 1 hit throunh rariv — ► HrAn 


RPTB 




Repeat block of instructions 


sre -* RE 
















Next PC -* RS 


RPTS 




Repeat single instruction 


sre -* RC 








1 ST (RIA) 








Next PC -* RS 








Next PC -* RE 


SIGI 




Signal, interlocked 


Signal interlocked operation 








Wail Tor interiocK aCKnuwieuge 








Clear interlock 


STF 




QtrtrA flftatinri-rvtint \#oli ia 
OlUlt? IIUcUlliy-pUllH V0IU6 


Hn — * uaoor 


oTFI 




Store floating-point value, interlocked 


Rn -* Daddr 








Signal end of interlocked operation 


STI 




Store integer 


Sreg Daddr 


STII 




oiore integer, inxeriocKea 


Sreg — * Daddr 








Signal end of interlocked operation 


SUBB 




Subtract integers with borrow 


Drea — sre — C — * Drea 


Legend: 


C 


carry bit 


RM repeat mode bit 




cond 


condition code 


RS repeat start register 




Daddr 


destination memory address 


Rn register address (R7-R0) 




Dreg 


register address (any register) 


SP stack pointer 




GIE 


global interrupt enable register 


ST status register 




PC 


program counter 


Sreg register address (any register) 




RC 


repeat counter register 


sre general addressing modes 




RE 


repeat interrupt register 
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Table 2-2. Instruction Set Summary (Concluded) 



Mnemonic Description 



Operation 



SUBB3 
SUBC 



SUBF 

SUBF3 

SUBI 

SUBI3 

SUBRB 

SUBRF 

SUBRI 

SWI 

TRAP cond 



TSTB 
TSTB3 
XOR 
XOR3 



Subtract integers with borrow (3 operand) 
Subtract integers conditionally 

Subtract floating-point values 

Subtract floating-point values (3 operand) 

Subtract integers 

Subtract integers (3 operand) 

Subtract reverse integer with borrow 

Subtract reverse floating-point value 

Subtract reverse integer 

Software interrupt 

Trap conditionally 



Test bit fields 

Test bit fields (3 operand) 

Bitwise exclusive OR 

Bitwise exclusive OR (3 operand) 



srcl - src2 - C Dreg 

If Dreg - src 2 0: 

[(Dreg - src) « 1] OR 1 -* Dreg 

Else, Dreg « 1 -* Dreg 

Rn - src -* Rn 

srd - srdl -* Rn 

Dreg - src -* Dreg 

srcl - src2 -+> Dreg 

src- Dreg -C -* Dreg 

src-Rn -* Rn 

src- Dreg Dreg 

Perform emulator interrupt sequence 

If cond = true or missing: 

NextPC-»*++ SP 

Trap vector N -* PC 

0-*ST(GIE) 
Else, continue 

Dreg AND src 

srcl AND src2 

Dreg XOR src Dreg 

srd XOR src2 -* Dreg 



Legend: C carry bit 

cond condition code 

Dreg register address (any register) 

GIE global interrupt enable register 

N any trap vector 0-27 

PC program counter 



Rn register address (R7-R0) 

SP stack pointer 

src general addressing modes 

src\ 3-operand addressing modes 

src2 3-operand addressing modes 

ST status register 
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2.5 Internal Bus Operation 

Much of the TMS320C3x's high performance is due to internal busing and par- 
allelism. The separate program buses (PADDR and PDATA), data buses 
(DADDR1, DADDR2, and DDATA), and DMA buses (DMAADDR and 
DMADATA) allow for parallel program fetches, data accesses, and DMA ac- 
cesses. These buses connect all of the physical spaces (on-chip memory, 
off-chip memory, and on-chip peripherals) supported by the TMS320C30. 
Figure 2-3 shows these internal buses and their connection to on-chip and off- 
chip memory blocks. 

The PC is connected to the 24-bit program address bus (PADDR) . The instruc- 
tion register (IR) is connected to the 32-bit program data bus (PDATA). These 
buses can fetch a single instruction word every machine cycle. 

The 24-bit data address buses (DADDR1 and DADDR2) and the 32-bit data 
data bus (DDATA) support two data memory accesses every machine cycle. 
The DDATA bus carries data to the CPU over the CPU1 and CPU2 buses. The 
CPU1 and CPU2 buses can carry two data memory operands to the multiplier, 
ALU, and register file every machine cycle. Also internal to the CPU are regis- 
ter buses REG1 and REG2, which can carry two data values from the register 
file to the multiplier and ALU every machine cycle. Figure 2-2 shows the buses 
internal to the CPU section of the processor. 

The DMA controller is supported with a 24-bit address bus (DMAADDR) and 
a 32-bit data bus (DMADATA) . These buses allow the DMA to perform memory 
accesses in parallel with the memory accesses occurring from the data and 
program buses. 
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2.6 Parallel Instruction Set Summary 

Table 2-3 lists the 'C3x instruction set in alphabetical order. Each table entry 
shows the instruction mnemonic, description, and operation. Refer to Section 
10.3 on page 10-14 for a functional listing of the instructions and individual 
instruction descriptions. 
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Table 2-3. Parallel Instruction Set Summary 



Mnemonic Description 



Operation 



ABSF 
||STF 

ABSI 

II an 

ADDF3 
||STF 

ADDI3 
||STI 

AND3 

II sn 

ASH3 
IISTI 



FIX 
II STI 

FLOAT 
II STF 

LDF 
II STF 

LDI 
IISTI 

LSH3 
IISTI 



MPYF3 
|| STF 

MPYI3 
IISTI 



Parallel Arithmetic With Store Instructions 

Absolute value of a floating point 



Absolute value of an integer 
Add floating point 
Add integer 
Bitwise logical AND 
Arithmetic shift 



Convert floating point to integer 
Convert integer to floating point 
Load floating point 
Load integer 
Logical shift 



Multiply floating point 



Multiply integer 



src2\ -* dsn 
| srcQ ds12 



$rc2\ - 
I $rC3- 



dsn 
>ds& 



srd + src2 -+ dsn 
|| src3 -* dst2 

srd + src2 -*• dsn 
|| srcQ -* dstZ 

srd AND srdZ -> dst\ 
|| srcQ -* dst2 

If count ^ 0: 

src2 « count dsn 

|| src3 -* dstZ 

Else: 

src2 » |count| -+ dst\ 
|| srcQ dst2 

F\x(src2) -* dst\ 
|| src3 -* ds12 

Float(src2) -* dsn 
|| srd3 -+ dst2 

src2 -* dsn 
|| srcQ -* ds12 

src2 -+ dsn 
|| src3 -+ dstZ 

If count * 0: 
src2 « count dsn 
|| src3 — ds12 
Else: 

src2 » |count| dsn 
|| src3 -+ dsH 

srd x src2 -* dsn 
|| srcQ -* cfeC 

srd x src2 -* cfsfl 
II src3 -* d$t2 



Legend: count register addr (R7-R0) 
dsfl register addr (R7-R0) 
dstZ indirect addr (disp = 0, 1 , IRO, IR1) 



srd register addr (R7-R0) 

src2 indirect addr (disp = 0, 1, IRO, IR1) 

src3 register addr (R7-R0) 



2-24 



Parallel Instruction Set Summary 



Table 2-3. Parallel Instruction Set Summary (Continued) 



Mnemonic Description Operation 

Parallel Arithmetic With Store Instructions (Concluded) 

NEGF Negate floating point 0- src2. -» dst\ 

II STF || S rc3 dsQ 

NEGI Negate integer 0 - src2 -* dsn 

II STI || src3 — dstZ 

NOT Complement srcl -* dst\ 

II STI || S rc3 -+ dst2 

OR3 Bitwise logical OR srd OR $rc2 -+ dst) 

II STI || src3 — ds£2 

STF Store floating point srd c/sfl 

II STF || S rc3 -> c/s/2 

STI Store integer srcl -» cfefl 

II STI || src3 -* cfe£ 

SUBF3 Subtract floating point srd - src2 -* cfsfl 

II STF || srC 3 -* dsC 

SUBI3 Subtract integer srd - src2 -> tfsfl 

II STI || src3 -* cfsfi 

XOR3 Bitwise exclusive OR srd XOR src2 -* dsn 

II STI || S rc3 dsQ 

Parallel Load Instructions 

LDF Load floating point src2 -* dst\ 

II LDF || srcA -> dsQ 

LDI Load integer src2 -* ctefl 

II LDI || srC 4 -* dsQ 

Parallel Multiply And Add/Subtract Instructions 

MPYF3 Multiply and add floating point op1 x op2 -* op3 

II ADDF3 || 0 p4 + op5 op6 

MPYF3 Multiply and subtract floating point op1 x op2 -> op3 

|| SUBF3 ||op4-op5-*op6 

MPYI3 Multiply and add integer op1 x op2 ~> op3 

II ADDI3 || 0 p4 + op5 -* op6 

MPYI3 Multiply and subtract integer op1 x op2 -* op3 

II SUBI3 ||op4-op5-»op6 

Legend: dst\ register addr (R7-R0) op3 register addr (R0 or R1) 

dst2 indirect addr (disp = 0, 1 , IRO, IR1 ) op6 register addr (R2 or R3) 

op1 , op2, op4, and op5 Any two of these srd register addr (R7-R0) 

operands must be specified using src2 indirect addr (disp = 0, 1 , IRO, IR1) 

register addr; the remaining two src3 register addr (R7-R0) 
must be specified using indirect. 
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2.7 External Bus Operation 

The TMS320C30 provides two external interfaces: the primary bus and the ex- 
pansion bus. The TMS320C31 provides one external interface: the primary 
bus. Both primary and expansion buses consist of a 32-bit data bus and a set 
of control signals. The primary bus has a 24-bit address bus, whereas the ex- 
pansion bus has a 1 3-bit address bus. Both buses can be used to address ex- 
ternal program/data memory or I/O space. The buses also have an external 
RDY signal for wait-state generation. You can insert additional wait states un- 
der software control. Refer to Chapter 7 for detailed information on external 
bus operation. 



2.7.1 External Interrupts 

The TMS320C3x supports four external interru pts (I NT3 -I NTO) , a number of 
internal interrupts, and a nonmaskable external RESET signal. These can be 
used to interrupt either the DMA or the CPU. When the CPU responds to the 
interrupt, the IACK pin can be used to signal an e xternal i nterrupt acknowl- 
edge. Section 6.5 (beginning on page 6-18) covers RESET and interrupt pro- 
cessing. 



2.7.2 Interlocked-lnstruction Signaling 

Two external I/O flags, XFO and XF1 , can be configured as input or output pins 
under software control. These pins are also used by the interlocked operations 
of the TMS320C3X. The interlocked-operations instruction group supports 
multiprocessor communication (see Section 6.4 on page 6-1 2 for examples of 
the use of interlocked instructions). 
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2.8 Peripherals 



All TMS320C3x peripherals are controlled through memory-mapped registers 
on a dedicated peripheral bus. This peripheral bus is composed of a 32-bit data 
bus and a 24-bit address bus. This peripheral bus permits straightforward 
communication to the peripherals. The TMS320C3x peripherals include two 
timers and two serial ports (only one serial port is available on the 
TMS320C31). Figure 2-6 shows the peripherals with associated buses and 
signals. Refer to Chapter 8 for detailed information on the peripherals. 



Figure 2-6. Peripheral Modules 




Available on TMS320C30 
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2.8.1 Timers 

The two timer modules are general-purpose 32-bit timer/event counters with 
two signaling modes and internal or external clocking. Each timer has an I/O 
pin that can be used as an input clock to the timer or as an output signal driven 
by the timer. The pin can also be configured as a general-purpose I/O pin. 

2.8.2 Serial Ports 

The two bidirectional serial ports are totally independent. They are identical to 
a complementary set of control registers that control each port. Each serial 
port can be configured to transfer 8, 16, 24, or 32 bits of data per word. The 
clock for each serial port can originate either internally or externally. An inter- 
nally generated divide-down clock is provided. The serial port pins are confi- 
gurable as general-purpose I/O pins. The serial ports can also be configured 
as timers. A special handshake mode allows TMS320C3xs to communicate 
over their serial ports with guaranteed synchronization. 
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2.9 Direct Memory Access (DMA) 

The on-chip DMA controller can read from or write to any location in the 
memory map without interfering with the operation of the CPU. Therefore, the 
TMS320C3x can interface to slow external memories and peripherals without 
reducing throughput to the CPU. The DMA controller contains its own address 
generators, source and destination registers, and transfer counter. Dedicated 
DMA address and data buses minimize conflicts between the CPU and the 
DMA controller. A DMA operation consists of a block or single-word transfer 
to or from memory. Refer to Section 8.3 on page 8-43 for detailed information 
on the DMA controller. Figure 2-7 shows the DMA controller with associated 
buses. 



Figure 2-7. DMA Controller 



/ DMADATA Bus 7 



/ DMAAPPR Bus / 



DMA Controller 


Global Control Register 




Source Address Register 




Destination Address Register 




Transfer Counter Register 
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2.10 TMS320C30 and TMS320C31 Differences 



This section addresses the major memory access differences between the 
TMS320C31 and the TMS320C30 devices. Observance of these consider- 
ations is critical for achieving design goal success. 

Table 2-4 shows these differences, which are detailed in the following subsec- 
tions. 



Table 2-4. Feature Set Comparison 



Feature 



TMS320C31 



TMS320C30 



Data/program bus 



Serial I/O ports 

User program/data ROM 

Program boot loader 



Primary bus: one bus composed of Two buses: 

a 32-bit data and a 24-bit address • Primary bus: a 32-bit data and a 

bus 24-bit address 

• Expansion bus: a 32-bit data and 
a 13-bit address 



1 serial port (SP0) 
Not available 
User selectable 



2 serial ports (SP0, SP1) 
4K words/1 6K bytes 
Not available 



2.10.1 Data/Program Bus Differences 

The TMS320C31 uses only the primary bus and reserves the memory space 
that was previously used for expansion bus operations. 



Be careful! Program access to a reserved area produces 
unpredictable results. 



2.10.2 Serial-Port Differences 

Serial port 1 references in Section 8.2 are not applicable to the TMS320C31 . 
The memory locations identified for the associated control registers and buff- 
ers are reserved. 

2.10.3 Reserved Memory Locations 

Table 2-5 identifies TMS320C31 reserved memory locations in addition to 
those shown in Figure 3-8 on page 3-16. 
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Table 2-5. TMS320C31 Reserved Memory Locations 



Feature 


TMS320C31 


TMS320C30 


OxOOOOOO-OxOOOFFF 


Reservedt 


Microcomputer program/data ROM modet 


0x800000-0x801 FFF 


Reserved 


Expansion bus MSTRB space 


Ox804000-Ox805FFF 


Reserved 


Expansion bus IOSTRB space 


0x808050 


Reserved 


SP1 global-control register 


0x808052-0x808056 


Reserved 


SP1 local-control registers 


0x808058 


Reserved 


SP1 data-transmit buffer 


0X80805C 


Reserved 


SP1 receive-transmit buffer 


0x808060 


Reserved 


Expansion bus control register 


T Applies to the MCBL and MC modes only. 



2.10.4 Effects on the IF and IE Interrupt Registers 

The bits associated with serial port 1 in the IE (interrupt enable) register and 
the IF (interrupt flag) register for the TMS320C30 are not applicable to the 
TMS320C31 . Write only logic 0 data to IE register bits 6, 7, 22, and 23 and to 
IF register bits 6 and 7. Writing logic 1s to these bits produces unpredictable 
results. 

2.10.5 User Program/Data ROM 

The user program/data ROM that is available for the TMS320C30 device does 
not exist for the TMS320C31. Rather, the memory locations that were allo- 
cated to support user program/data ROM operations have been reserved on 
theTMS320C31 to support microcomputer/boot loader accessing. See 
Chapter 3 for more information on using the microcomputer/boot loader func- 
tion. 

2.10.6 Development Considerations 

If you are developing application code using a TMS320C3x simulator, XDS, 
or ASM/LNK, Tl recommends that you modify the .cfm and .cmd files by re- 
moving these memory spaces from the tool's configured memory. This 
ensures that your developed application performs as expected when the 
TMS320C31 device is used. 
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2.11 System Integration 

In summary, the TMS320C3x is a powerful DSP system that integrates an in- 
novative, high-performance CPU, two external interface ports, large memo- 
ries, and efficient buses to support its speed. A single chip contains this sys- 
tem, along with peripherals such as a DMA controller, two serial ports, and two 
timers. The TMS320C3x system is truly an affordable single-chip solution. 
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CPU Registers, Memory, and Cache 



The central processing unit (CPU) register file contains 28 registers that can 
be operated on by the multiplier and arithmetic logic unit (ALU). Included in the 
register file are the auxiliary registers, extended-precision registers, and index 
registers. The registers in the CPU register file support addressing, float- 
ing-point/integer operations, stack management, processor status, block re- 
peats, and interrupts. 

The TMS320C3x provides a total memory space of 1 6M (million) 32-bit words 
containing program, data, and I/O space. Two RAM blocks of 1 Kx 32 bits each 
and a ROM block of 4K x 32 bits (available only on the TMS320C30) permit 
two CPU accesses in a single cycle. The memory maps for the microcomputer 
and microprocessor modes are similar, except that the on-chip ROM is not 
used in the microprocessor mode. 

A 64- x 32-bit instruction cache stores often-repeated sections of code. This 
greatly reduces the number of off-chip accesses and allows code to be stored 
off-chip in slower, lower-cost memories. Three bits in the CPU status register 
control the clear, enable, or freeze of the cache. 

This chapter describes in detail each of the CPU registers, the memory maps, 
and the instruction cache. Major topics are as follows: 



Topic Page 



3.1 






3.2 






3.3 






3.4 


Using the TMS320C31 Boot Loader 
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3.1 CPU Register File 

The TMS320C3x provides 28 registers in a multiport register file that is tightly 
coupled to the CPU. The program counter (PC) is not included in the 28 regis- 
ters. All of these registers can be operated on by the multiplier and the ALU 
and can be used as general-purpose 32-bit registers. However, the registers 
also have some special functions for which they are particularly appropriate. 
For example, the eight extended-precision registers are especially suited for 
maintaining extended-precision floating-point results. The eight auxiliary reg- 
isters support a variety of indirect addressing modes and can be used as gen- 
eral-purpose 32-bit integer and logical registers. The remaining registers pro- 
vide system functions, such as addressing, stack management, processor 
status, interrupts, and block repeat. Refer to Chapter 5 for detailed information 
and examples of the use of CPU registers in addressing. 

Table 3-1 lists the registers names and assigned functions. 

Table 3-1. CPU Registers 



DamIbIad 

riBQISlOr 


Assignoa rune lion iMamo 


RO 


Extended-precision register 0 


R1 


Extended-precision register 1 


R2 


Extended-precision register 2 


R3 


Extended-precision register 3 


R4 


Extended-precision register 4 


R5 


Extended-precision register 5 


R6 


Extended-precision register 6 


R7 


Extended-precision register 7 


ARO 


Auxiliary register 0 


AR1 


Auxiliary register 1 


AR2 


Auxiliary register 2 


AR3 


Auxiliary register 3 


AR4 


Auxiliary register 4 


AR5 


Auxiliary register 5 


AR6 


Auxiliary register 6 


AR7 


Auxiliary register 7 


DP 


Data-page pointer 


IRO 


Index register 0 


IR1 


Index register 1 


BK 


Block-size register 


SP 


System stack pointer 


ST 


Status register 


IE 


CPU/DMA interrupt enable 


IF 


CPU interrupt flags 


IOF 


I/O flags 


RS 


Repeat start address 


RE 


Repeat end address 


RC 


Repeat counter 
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3.1 .1 Extended-Precision Registers (R7-R0) 

The eight extended-precision registers (R7-R0) are capable of storing and 
supporting operations on 32-bit integer and 40-bit floating-point numbers. 
These registers consist of two separate and distinct regions: 

□ bits 39-32: dedicated to storage of the exponent (e) of the floating-point 
number. 

□ bits 31-0: store the mantissa of the floating-point number: 

■ bit 31 : sign bit (s) 

■ bits 30-0: the fraction (f) 

Any instruction that assumes the operands are floating-point numbers uses 
bits 39-0. Figure 3-1 illustrates the storage of 40-bit floating-point numbers 
in the extended-precision registers. 

Figure 3-1. Extended-Precision Register Floating-Point Format 

39 32 31 30 0 



fraction < 



mantissa 



For integer operations, bits 31-0 of the extended-precision registers contain 
the integer (signed or unsigned). Any instruction that assumes the operands 
are either signed or unsigned integers uses only bits 31-0. Bits 39-32 remain 
unchanged. This is true for all shift operations. The storage of 32-bit integers 
in the extended-precision registers is shown in Figure 3-2. 



Figure 3-2. Extended-Precision Register Integer Format 



39 



32 31 



unchanged 



signed or unsigned integer 



3.1 .2 Auxiliary Registers (AR7-AR0) 

The eight 32-bit auxiliary registers (AR7-AR0) can be accessed by the CPU 
and modified by the two Auxiliary Register Arithmetic Units (ARAUs). The pri- 
mary function of the auxiliary registers is the generation of 24-bit addresses. 
However, they can also be used as loop counters in indirect addressing or as 
32-bit general-purpose registers that can be modified by the multiplier and 
ALU. Refer to Chapter 5 for detailed information and examples of the use of 
auxiliary registers in addressing. 
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3.1 .3 Data-Page Pointer (DP) 

The data-page pointer (DP) is a 32-bit register that is loaded using the LDP 
instruction. The eight LSBs of the data-page pointer are used by the direct ad- 
dressing mode as a pointer to the page of data being addressed. Data pages 
are 64K words long, with a total of 256 pages. Bits 31-8 are reserved; you 
should always keep these set to 0 (cleared). 

3.1 .4 Index Registers (IRO, IR1) 

The 32-bit index registers (IRO and IR1) are used by the ARAL) for indexing 
the address. Refer to Chapter 5 for detailed information and examples of the 
use of index registers in addressing. 

3.1 .5 Block Size Register (BK) 

The 32-bit block size register (BK) is used by the ARAU in circular addressing 
to specify the data block size (see Section 5.3 on page 5-24). 

3.1 .6 System Stack Pointer (SP) 

The system stack pointer (SP) is a 32-bit register that contains the address of 
the top of the system stack. The SP always points to the last element pushed 
onto the stack. The SP is manipulated by interrupts, traps, calls, returns, and 
the PUSH, PUSHF, POP, and POPF instructions. Pushes and pops of the 
stack perform preincrement and postdecrement, respectively, on all 32 bits of 
the stack pointer. However, only the 24 LSBs are used as an address. Refer 
to Section 5.5 on page 5-31 for information about system stack management. 

3.1 .7 Status Register (ST) 

The status register (ST) contains global information relating to the state of the 
CPU. Operations usually set the condition flags of the status register accord- 
ing to whether the result is 0, negative, etc. This includes register load and 
store operations as well as arithmetic and logical functions. When the status 
register is loaded, however, the contents of the source operand replace the 
current contents bit-for-bit, regardless of the state of any bits in the source op- 
erand. Therefore, following a load, the contents of the status register are iden- 
tically equal to the contents of the source operand. This allows the status regis- 
ter to be saved easily and restored. At system reset, 0 is written to this register. 
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Figure 3-3 shows the format of the status register. Table 3-2 defines the sta- 
tus register bits, their names, and their functions. 

Figure 3-3. Status Register 

31 30 29 28 27 26 25 24 23 22 21 20 19 18 17 18 
xx I xxl xx|xx|xx|xx|xx|xx| XX I XX I XX I XX I XX I XX I XX I XX I 



15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 

| xx | xx | gie| cc | ce | cf | xx | rm |ovm| luf| lv | uf | n [ z | v | c | 

R/WR/WR/WR/W R/WR/WR/WR/WR/WR/WR/WR/WR/W 

Notes: 1) xx = reserved bit, read as 0 
2) R = read, W = write 
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Table 3-2. Status Register Bits Summary 



Bit 


Name 


Reset Value 


Function 


nt 




0 


oarry nag 




V 


0 


Overflow flag 




■7 


U 


zero Tiag 


3t 


N 


0 


Negative flag 


4t 


UF 


0 


Floating-point underflow flag 


5t 


LV 


0 


Latched overflow flag 


6t 


LUF 


0 


Latched floating-point underflow flag 


7 


OVM 


0 


Overflow mode flag. This flag affects only the integer operations. If OVM 
= 0, the overflow mode is turned off; integer results that overflow are 
treated in no special way. If OVM = 1 , 

a) integer results overflowing in the positive direction are set to the 
most positive 32-bit twos-complement number (7FFFFFFFh), and 

b) integer results overflowing in the negative direction are set to the 
most negative 32-bit twos-complement number (80000000h). 

Note that the function of V and LV is independent of the setting of OVM . 


8 


RM 


0 


Repeat mode flag. If RM = 1 , the PC is being modified in either the 
repeat-block or repeat-single mode. 


9 


Reserved 


0 


Read as 0 


10 


CF 


0 


Cache freeze. When CF = 1, the cache is frozen. If the cache is enabled 
(CE = 1 ), fetches from the cache are allowed, but no modification of the 
state of the cache is performed. This function can be used to save fre- 
quently used code resident in the cache. At reset, 0 is written to this bit. 
Cache clearing (CC = 1) is allowed when CF = 0. 


11 


CE 


0 


Cache enable. CE = 1 enables the cache, allowing the cache to be used 
according to the least recently used (LRU) cache algorithm. CE = 0 dis- 
ables the cache; no update or modification of the cache can be per- 
formed. No fetches are made from the cache. This function is useful for 
system debugging. At system reset, 0 is written to this bit. Cache clear- 
ing (CC = 1) is allowed when CE = 0. 


12 


CC 


0 


Cache clear. CC = 1 invalidates all entries in the cache. This bit is always 
cleared after it is written to and thus always read as 0. At reset, 0 is writ- 
ten to this bit. 


13 


GIE 


0 


Global interrupt enable. If GIE = 1 , the CPU responds to an enabled in- 
terrupt. If GIE = 0, the CPU does not respond to an enabled interrupt. 


15-14 


Reserved 


0 


Read as 0 


31-16 


Reserved 


0-0 


Value undefined 



tfhe seven condition flags (ST bits 6-0) are defined in Section 10.2 on page 10-10. 
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3.1 .8 CPU/DMA Interrupt Enable Register (IE) 

The CPU/DMA interrupt enable register (IE) is a 32-bit register (see 
Figure 3-4). The CPU interrupt enable bits are in locations 10-0. The direct 
memory access (DMA) interrupt enable bits are in locations 26-16. A 1 in a 
CPU/DMA IE register bit enables the corresponding interrupt. A 0 disables the 
corresponding interrupt. At reset, 0 is written to this register. Table 3-3 defines 
the register bits, the bit names, and the bit functions. 



Figure 3-4. CPU/DMA Interrupt Enable Register (IE) 



31 


30 


29 


28 


27 


26 


25 


24 




23 


22 


21 


20 


19 


16 


17 


16 


XX 


XX 


XX 


XX 


XX 


EDINT 


ETINT1 


ETINTO 


ERINT1 


EXINT1 


ERINTO 


EXINTO 


EINT3 


EINT2 


EINT1 


EINTO 


(DMA) 


(DMA) 


(DMA) 


(DMA) 


(DMA) 


(DMA) 


(DMA) 


(DMA) 


(DMA) 


(DMA) 


(DMA) 












R/W 


R/W 


R/W 




R/W 


R/W 


R/W 


R/W 


R/W 


R/W 


R/W 


R/W 


1 ? 


14 


1 ? 


12 


11 


19 


? 


8 






, <? , 


5 




, ? ■ 


■ I ■ 


, L_ 


_2_ 


XX 


XX 


XX 


XX 


XX 


EDINT 


ETINT1 


ETINTO 


ERINT1 


EXINT1 


ERINTO 


EXINTO 


EINT3 


EINT2 


EINT1 


EINTO 


(CPU) 


(CPU) 


(CPU) 




(CPU) 


(CPU) 


(CPU) 


(CPU) 


(CPU) 


(CPU) 


(CPU) 


(CPU) 












R/W 


R/W 


R/W 




R/W 


R/W 


R/W 


R/W 


R/W 


R/W 


R/W 


R/W 



Notes: 1) xx = reserved bit, read as 0 
2) R = read, W = write 
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Table 3-3. IE Register Bits Summary 



Bit 


N&IY16 




Function 

1 Ml IWIIVI 1 


0 


EINTO 


0 


Enable external interrupt 0 (CPU) 


1 


EINT1 


0 


Enable external interrupt 1 (CPU) 


2 


EINT2 


0 


Enable external interrupt 2 (CPU) 


3 


EINT3 


0 


Enable external interrupt 3 (CPU) 


4 


EXINTO 


0 


Enable serial-port 0 transmit interrupt (CPU) 


5 


ERINTO 


0 


Enable serial-port 0 receive interrupt (CPU) 


6 


EXINT1 


0 


Enable serial-port 1 transmit interrupt (CPU) 


7 


ERINT1 


0 


Enable serial-port 1 receive interrupt (CPU) 


8 


ETINTO 


0 


Enable timer 0 interrupt (CPU) 


9 


ETINT1 


0 


Enable timer 1 interrupt (CPU) 


10 


EDINT 


0 


Enable DMA controller interrupt (CPU) 


15-11 


Reserved 


0 


Value undefined 


16 


EINTO 


0 


Enable external interrupt 0 (DMA) 


17 


EINT1 


0 


Enable external interrupt 1 (DMA) 


18 


EINT2 


0 


Enable external interrupt 2 (DMA) 


19 


EINT3 


0 


Enable external interrupt 3 (DMA) 


20 


EXINTO 


0 


Enable serial-port 0 transmit interrupt (DMA) 


21 


ERINTO 


0 


Enable serial-port 0 receive interrupt (DMA) 


22 


EXINT1 


0 


Enable serial-port 1 transmit interrupt (DMA) 


23 


ERINT1 


0 


Enable serial-port 1 receive interrupt (DMA) 


24 


ETINTO 


0 


Enable timer 0 interrupt (DMA) 


25 


ETINT1 


0 


Enable timer 1 interrupt (DMA) 


26 


EDINT 


0 


Enable DMA controller interrupt (DMA) 


31-27 


Reserved 


0-0 


Value undefined 
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3.1.9 CPU Interrupt Flag Register (IF) 

Figure 3-5 shows the 32-bit CPU interrupt flag register (IF). A 1 in a CPU IF 
register bit indicates that the corresponding interrupt is set. The IF bits are set 
to 1 when an interrupt occurs. They may also be set to 1 through software to 
cause an interrupt. A 0 indicates that the corresponding interrupt is not set. If 
a 0 is written to an IF register bit, the corresponding interrupt is cleared. At re- 
set, 0 is written to this register. Table 3-4 lists the bit fields, bit-field names, and 
bit-field functions of the CPU IF register. 

Figure 3-5. CPU Interrupt-Flag Register (IF) 

31 29 27 26 25 24 23 22 21 20 19 18 17 18 
|xx|xx|xx|xx|xx| xx I xx | xx I xx | xx I xx | xx I xx I xx I xx I xxl 
30 28 



, 15 . , 13 . 10 L 9 . 8 7 6 8 4 . 3 . 2 . 1 . 0 . 

14 12 R/W R/W R/W R/W R/W R/W R/W R/W R/W R/W R/W 



Notes: 1) xx = reserved bit, read as 0 
2) R = read, W = write 



Table 3-4. IF Register Bits Summary 



Bit 


Name 


Reset Value 


Function 


0 


INTO 


0 


External interrupt 0 flag 


1 


INT1 


0 


External interrupt 1 flag 


2 


INT2 


0 


External interrupt 2 flag 


3 


INT3 


0 


External interrupt 3 flag 


4 


XINTO 


0 


Serial-port 0 transmit interrupt flag 


5 


RINTO 


0 


Serial-port 0 receive interrupt flag 


6 


XINTlt 


0 


Serial-port 1 transmit interrupt flag 


7 


RINTlt 


0 


Serial-port 1 receive interrupt flag 


8 


TINTO 


0 


Timer 0 interrupt flag 


9 


TINT1 


0 


Timer 1 interrupt flag 


10 


DINT 


0 


DMA channel interrupt flag 


31-11 


Reserved 


0-0 


Value undefined 



t Reserved on TMS320C31 
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3.1.10 I/O Flags Register (IOF) 

The I/O flags register (IOF) is shown in Figure 3-6 and controls the function 
of the dedicated external pins, XFO and XF1 . These pins can be configured for 
input or output. The pins can also be read from and written to. At reset, 0 is 
written to this register. Table 3-5 shows the bit fields, bit-field names, and bit- 
field functions. 



Figure 3-6. 1/O-Flag Register (IOF) 



31 30 29 28 27 26 25 24 23 22 21 20 19 18 17 16 

I xx | xx |xx| xx | xx | xx |xx| 



ESS ESS ESS BS ES E33 E ESS 



XX 



15 14 13 12 11 10 9 8 



Bfl B3 BBBB BH B3 Bfl PS ■ i a m a ■ [>m mail Ktxffli BS BR j»m BJi »n a »i I riVta jti PS 



R/w 



5. 

R/W 



j3_ 

R 



2 
R/W 



J 0 

R/w" 



Notes: 1) xx = reserved bit, read as 0 
2) R = read, W = write 
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Table 3-5. IOF Register Bits Summary 



Bit 


Name 


Reset Value 


Function 


0 


Reserved 


0 


Read as 0 


1 


l/OXFO 


0 


If j/OXFO = 0, XFO is configured as a general-purpose input pin. 
If l/OXFO = 1 , XFO is configured as a general-purpose output pin. 


2 


OUTXFO 


0 


Data output on XFO 


3 


INXFO 


0 


Data input on XFO. A write has no effect. 


4 


Reserved 


0 


Read as 0 


5 


T/OXF1 


0 


If I/OXF1 = 0, XF1 is configured as a general-purpose input pin. 
If T/OXF1 = 1 , XF1 is configured as a general-purpose output pin. 


6 


OUTXF1 


0 


Data output on XF1 


7 


INXF1 


0 


Data input on XF1 . A write has no effect. 


31-8 


Reserved 


0-0 


Read as 0 



3.1.11 Repeat-Count (RC) and Block-Repeat Registers (RS, RE) 

The 32-bit repeat start address register (RS) contains the starting address of 
the block of program memory to be repeated when the CPU is operating in the 
repeat mode. 

The 32-bit repeat end address register (RE) contains the ending address of 
the block of program memory to be repeated when the CPU is operating in the 
repeat mode. 

i 1 

Note: RE < RS 

If RE < RS, the block of program memory will not be repeated, and the code 

will not loop backwards. However, the ST(RM) bit remains set to 1 . 
i i 

The repeat-count register (RC) is a 32-bit register used to specify the number 
of times a block of code is to be repeated when a block repeat is performed. 
If RC contains the number n, the loop is executed n + 1 times. 

3.1.12 Program Counter (PC) 

The PC is a 32-bit register containing the address of the next instruction to be 
fetched. While the program counter register is not part of the CPU register file, 
it can be modified by instructions that modify the program flow. 
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3.1.13 Reserved Bits and Compatibility 

To retain compatibility with future members of the TMS320C3x family of micro- 
processors, reserved bits that are read as 0 must be written as 0. A reserved 
bit that has an undefined value must not have its current value modified. In oth- 
er cases, you should maintain the reserved bits as specified. 
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3.2 Memory 

The TMS320C3x's total memory space of 1 6M (million) 32-bit words contains 
program, data, and I/O space, allowing tables, coefficients, program code, or 
data to be stored in either RAM or ROM. In this way, you can maximize memory 
usage and allocate memory space as desired. 

RAM blocks 0 and 1 are each 1 K x 32 bits. The ROM block is 4K x 32 bits. Each 
on-chip RAM and ROM block is capable of supporting two CPU accesses in 
a single cycle. The separate program buses, data buses, and DMA buses al- 
low for parallel program fetches, data reads/writes, and DMA operations. 
Chapter 9 covers this in detail. 

3.2.1 TMS320C3x Memory Maps 

The memory map depends on whether the processor is running in micropro- 
cessor mode (MC/MP or MCBL/MP = 0) or microcomputer mode (MC/MP or 
MCBUMP = 1). The memory maps for these modes are similar (see 
Figure 3-7). Locations 800000h through 801 FFFh are mapped to the expan- 
sion bus . When this region, available only on the TMS320C30, is accessed, 
MSTRB is active. Locations 802000h through 803FFFh are reserved. Loca- 
tions 804000h through 805FFFh are mapped to the expansion bus. When this 
region, available only on the TMS320C30, is accessed, IOSTRB is active. Lo- 
cations 806000h through 807FFFh are reserved. All of the memory-mapped 
peripheral registers are in locations 808000h through 8097FFh. In both 
modes, RAM block 0 is located at addresses 809800h through 809BFFh, and 
RAM block 1 is located at addresses 809C00h through 809FFFh. Memory lo- 
cations 80A000h through OFFFFFFh are accessed over the primary external 
memory port (STRB active). 

In microprocessor mode, the 4K on-chip ROM (TMS320C30) or boot loader 
(TMS320C31) is not mapped into the TMS320C3x memory map. As shown 
in Figure 3-7, locations Oh through 03Fh consist of interrupt vector, trap vec- 
tor, and reserv ed loca tions, all of which are accessed over the primary external 
memory port (STRB active). Interrupt and trap vector locations are shown in 
Figure 3-9. Locations 040h-7FFFFFh and 80A0OOL-FFFFFFh are also ac- 
cessed over the primary external memory port. 
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In microcomputer mode, the 4K on-chip ROM (TMS320C30) or boot loader 
(TMS320C31) is mapped into locations Oh through OFFFh. There are 192 lo- 
cations (Oh through BFh) within this block for interrupt vectors, trap vectors, 
and a reserved space. Locat ions 10 00h-7FFFFFh are accessed over the pri- 
mary external memory port (STRB active). 



Reserved Spaces 

Do not read and write to reserved portions of the TMS320C3x 
memory apace and reserved peripheral bus addraasaa. Doing so 
might eauaa the TMS320C3x to halt operation and require a system 
reset to restart, 
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Figure 3-7. TMS320C30 Memory Maps 



Oh 



03Fh 
040h 



7FFFFFh 
800000h 



801 FFFh 
802000h 



803FFFh 
804000h 



805FFFh 
806000h 



807FFFh 
808000h 



8097FFh 
809800h 



809BFFh 
809C00h 



809FFFh 
80A000h 



OFFFFFFh 



Reset, Interrupt, Trap Vector, 
and Reserv ed Loca tions (64) 
External STRB Active 



External 
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Expansio n Bus 
MSTRB Active 
(8K Words) 



Reserved 
(8K Words) 



Expansio n Bus 
IOSTRB Active 
(8K Words) 



Reserved 
(8K Words) 



Peripheral Bus 
Memory-Mapped 

Registers 
(6K Words Internal) 



RAM Block 0 
(1K Word Internal) 



RAM Block 1 
(1K Word Internal) 



External 



STRB Active 



Oh 



OBFh 
OCOh 



OFFFh 
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7FFFFFh 
800000h 



801 FFFh 
802000h 



803FFFh 
804000h 



805FFFh 
806000h 



807FFFh 
808000h 



8097FFh 
809800h 



809BFFh 
809C00h 



809FFFh 
80A000h 



OFFFFFFh 



Reset, Interrupt, Trap Vector, 
and Reserved Locations (192) 



ROM 
(Internal) 



External 



STRB Active 



Expansio n Bus 
MSTRB Active 
(8K Words) 



Reserved 
(8K Words) 



Expansio n Bus 
IOSTRB Active 
(8K Words) 



Reserved 
(8K Words) 



Peripheral Bus 
Memory-Mapped 

Registers 
(6K Words Internal) 



RAM Block 0 
(1K Word Internal) 



RAM Block 1 
(1K Word Internal) 



Exte rnal 
STRB Active 



(a) Microprocessor Mode 



(b) Microcomputer Mode 
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Figure 3-8. TMS320C31 Memory Maps 
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040h 
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80A000h 



FFFFFFh 
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(External STRB Active) 
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Reserved for Boot 
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(See Section 3.4.) 
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Peripheral Bus 
Memory-Mapped 
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(b) Microcomputer/Boot Loader Mode 



Boot 1-3 locations are used by the boot-loader function. See Section 3.4 for 
a complete description. All reserved memory locations are described in 
Table 2-5 on page 2-31 . 
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3.2.2 TMS320C31 Memory Maps 

Setting the TMS320C31 MCBL/MP pin determines the mode in which the 
TMS320C31 can function: 

□ Microprocessor mode (MCBL/MP = 0), or 

□ Microcomputer/boot loader mode (MCBUMP = 1) 

The major difference between these two modes is their memory maps (see 
Figure 3-8) . The program boot load feature is enabled when the MCBL/MP pin 
is driven high during reset. 

Figure 3-8 shows the memory locations (internal and external) used by the 
boot loader to load the source program. 

3.2.3 Reset/Interrupt/Trap Vector Map 

The addresses for the reset, interrupt, and trap vectors are 00h-3Fh, as shown 
in Figure 3-9. The reset vector contains the address of the reset routine. 

Microprocessor and Microcomputer Modes 

In the microprocessor mode of the TMS320C30 and TMS320C31 and the 
microcomputer mode of the TMS320C30, the interrupt and trap vectors stored 
in locations 0h-3Fh are the addresses of the starts of the respective interrupt 
and trap routines. For example, at reset, the content of memory location OOh 
(reset vector) is loaded into the PC, and execution begins from that address. 
See Figure 3-9. 

Microcomputer/Boot Loader Mode 

In the microcomputer/boot loader mode of the TMS320C31 , the interrupt and 
trap vectors stored in locations 809FC1 h-809FFFh are branch instructions to 
the start of the respective interrupt and trap routines. See Figure 3-10. 
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Figure 3-9. Reset, Interrupt, and Trap-Vector Locations for the TMS320C30/TMS320C31 
Microprocessor Mode 



OOh 
01 h 
02h 
03h 
04h 
05h 
06h 
07h 
08h 
09h 
OAh 
OBh 

OCh 
1Fh 

20h 



3Bh 
3Ch 
3Dh 
3Eh 
3Fh 



RESET 
INTO 
INT1 
INT2 
INT3 
XINTO 
RINTO 
XINTlt 
RINTlt 
TINTO 
TINT1 
DINT 



RESERVED 



TRAPO 



TRAP 27 



TRAP 28 (Reserved) 



TRAP 29 (Reserved) 



TRAP 30 (Reserved) 



TRAP 31 (Reserved) 



t Reserved on TMS320C31 



Note: Traps 28-31 

Traps 28-31 are reserved; do not use them. 
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Figure 3-10. Interrupt and Trap Branch Instructions for the TMS320C31 Microcomputer 
Mode 



809FC1h 
809FC2h 
809FC3h 
809FC4h 
809FC5h 
809FC6h 
809FC7h 
809FC8h 
809FC9h 
809FCAh 
809FCBh 



809FCC- 
809FDFh 



809FE0h 
809FE1h 



809FFBh 
809FFCh 
809FFDh 
809FFEh 
809FFFh 



INTO 



INT1 



INT2 



INT3 



XINTO 



RINTO 



XINT1 



RINT1 



TINTO 



TINT1 



DINT 



RESERVED 



TRAPO 



TRAP1 



TRAP27 



TRAP28 (Reserved) 



TRAP29 (Reserved) 



TRAP30 (Reserved) 



TRAP31 (Reserved) 



Note: Traps 28-31 

Traps 28-31 are reserved; do not use them. 
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3.2.4 Peripheral Bus Map 

The memory-mapped peripheral registers are located starting at address 
808000h. The peripheral bus memory map is shown in Figure 3-1 1 . Each pe- 
ripheral occupies a 16-word region of the memory map. Locations 80801 Oh 
through 80801 Fh and locations 808070h through 8097FFh are reserved. 

Figure 3-1 1. Peripheral Bus Memory Map 



808000h 


DMA Controller Registers 


80800Fh 


(16) 


80801 Oh 


Reserved 


80801 Fh 


(16) 


808020h 


Timer 0 Registers 


80802Fh 


(16) 


808030h 


Timer 1 Registers 


80803Fh 


(16) 


808040h 


Serial-Port 0 Registers 


80804Fh 


(16) 


808050h 


Serial-Port 1 Registerst 


80805Fh 


(16) 


808060h 


Primary and Expansion Port 


80806Fh 


Registers (16) 


808070h 


Reserved 


8097FFh 





t Reserved on TMS320C31 
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3.3 Instruction Cache 

A 64 x 32-bit instruction cache facilitates maximum system performance by 
storing sections of code that can be fetched when the device repeatedly ac- 
cesses time-critical code. This reduces the number of off-chip accesses nec- 
essary and allows code to be stored off-chip in slower, lower-cost memories. 
The cache also frees external buses from program fetches so that they can be 
used by the DMA or other system elements. 

The cache can operate automatically, with no user intervention. Subsection 
3.3.2 describes a form of the least recently used (LRU) cache update algo- 
rithm. 

3.3.1 Cache Architecture 

The instruction cache (see Figure 3-12) contains 64 32-bit words of RAM; it 
is divided into two 32-word segments. Associated with each segment is a 
1 9-bit segment start address (SSA) register. For each word in the cache, there 
is a corresponding single bit: present (P) flag. 
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Figure 3-12. Instruction Cache Architecture 
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Segment Word 0 



Segment Word 1 



Segment Word 30 



Segment Word 31 



32 



BC 



J> Segment 0 
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Segment Number 
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Segment Number 



SSA Register 1 



30 



Segment Word 0 



Segment Word 1 



Segment Word 30 



Segment Word 31 



> Segment 1 



When the CPU requests an instruction word from external memory, the cache 
algorithm checks to determine whether the word is already contained in the 
instruction cache. Figure 3-1 3 shows the partitioning of an instruction address 
as used by the cache control algorithm. The algorithm uses thel 9 most signifi- 
cant bits (MSBs) of the instruction address to select the segment; the five least 
significant bits (LSBs) define the address of the instruction word within the per- 
tinent segment. The algorithm compares the 19 MSBs of the instruction ad- 
dress with the two SSA registers. If there is a match, the algorithm checks the 
relevant P flag. The P flag indicates whether a word within a particular segment 
is already present in cache memory. 



Figure 3-13. Address Partitioning for Cache Control Algorithm 



23 



54 



segment start address 
(SSA) 



instruction word 
address within segment 



If there is no match, one of the segments must be replaced by the new data. 
The segment replaced in this circumstance is determined by the LRU algo- 
rithm. The LRU stack (see Figure 3-12) is maintained for this purpose. 
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The LRU stack determines which of the two segments qualifies as the least 
recently used after each access to the cache; therefore, the stack contains ei- 
ther 0,1 or 1 ,0. Each time a segment is accessed, its segment number is re- 
moved from the LRU stack and pushed onto the top of the LRU stack. There- 
fore, the number at the top of the stack is the most recently used segment num- 
ber, and the number at the bottom of the stack is the least recently used seg- 
ment number. 

At system reset, the LRU stack is initialized with 0 at the top and 1 at the bot- 
tom. All P flags in the instruction cache are cleared. 

When a replacement is necessary, the least recently used segment is selected 
for replacement. Also, the 32 P flags for the segment to be replaced are set 
to 0, and the segment's SSA register is replaced with the 1 9 MSBs of the in- 
struction address. 



3.3.2 Cache Algorithm 

When the TMS320C3x requests an instruction word from external memory, 
one of two possible actions occurs: a cache hit or a cache miss. 

□ Cache Hit. The cache contains the requested instruction, and the follow- 
ing actions occur: 

1) The instruction word is read from the cache. 

2) The number of the segment containing the word is removed from the 
LRU stack and pushed to the top of the LRU stack, thus moving the 
other segment number to the bottom of the stack. 

□ Cache Miss. The cache does not contain the instruction. Following are 
the types of cache miss: 

■ Word miss. The segment address register matches the instruction ad- 
dress, but the relevant P flag is not set. The following actions occur in 
parallel: 

■ The instruction word is read from memory and copied into the 
cache. 

■ The number of the segment containing the word is removed from 
the LRU stack and pushed to the top of the LRU stack, thus mov- 
ing the other segment number to the bottom of the stack. 

■ The relevant P flag is set. 
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■ Segment miss. Neither of the segment addresses matches the in- 
struction address. The following actions occur in parallel: 

■ The least recently used segment is selected for replacement- The 
P flags for all 32 words are cleared. 

■ The SSA register for the selected segment is loaded with the 1 9 
MSBs of the address of the requested instruction word. 

■ The instruction word is fetched and copied into the cache. It goes 
into the appropriate word of the least recently used segment. The 
P flag for that word is set to 1 . 

■ The number of the segment containing the instruction word is re- 
moved from the LRU stack and pushed to the top of the LRU 
stack, thus moving the other segment number to the bottom of the 
stack. 

Only instructions may be fetched from the program cache. All reads and writes 
of data in memory bypass the cache. Program fetches from internal memory 
do not modify the cache and do not generate cache hits or misses. The pro- 
gram cache is a single-access memory block. Dummy program fetches (i.e., 
following a branch) are treated by the cache as valid program fetches and can 
generate cache misses and cache updates. 

Take care when using self-modifying code. If an instruction resides in cache 
and the corresponding location in primary memory is modified, the copy of the 
instruction in cache is not modified. 

You can use the cache more efficiently by aligning program code on 32-word 
address boundaries. Do this with the ALIGN directive when coding assembly 
language. 

3.3.3 Cache Control Bits 

Three cache control bits are located in the CPU status register: 

□ Cache Clear Bit (CC). Writing a 1 to the cache clear bit (CC) invalidates 
all entries in the cache. All P flags in the cache are cleared. The CC bit is 
always cleared after the cache is cleared. It is therefore always read as a 
0. At reset, the cache is cleared and 0 is written to this bit. 

□ Cache Enable Bit (CE). Writing a 1 to this bit enables the cache. When 
enabled, the cache is used according to the previously described cache 
algorithm. Writing a 0 to the cache enable bit disables the cache; no up- 
dates or modification of the cache can be performed. Specifically, no SSA 
register updates are performed, no P flags are modified (unless CC = 1), 
and the LRU stack is not modified. Writing a 1 to CC when the cache is 
disabled clears the cache, and, thus, the P flags. No fetches are made 
from the cache when the cache is disabled. At reset, 0 is written to this bit. 
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□ Cache Freeze Bit (CF). When CF = 1 , the cache is frozen. If, in addition, 
the cache is enabled, fetches from the cache are allowed, but no modifica- 
tion of the state of the cache is performed. Specifically, no SSA register 
updates are performed, no P flags are modified (unless CC = 1), and the 
LRU stack is not modified. You can use this function to keep frequently 
used code resident in the cache. Writing a 1 to CC when the cache is fro- 
zen clears the cache, and, thus, the P flags. At reset, 0 is written to this bit. 

Table 3-6 defines the effect of the CE and CF bits used in combination. 
Table 3-6. Combined Effect of the CE and CF Bits 



CE 


CF 


Effect 


0 


0 


Cache not enabled 


0 


1 


Cache not enabled 


1 


0 


Cache enabled and not frozen 


1 


1 


Cache enabled and frozen 
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3.4 Using the TMS320C31 Boot Loader 

This section describes how to use the TMS320C31 microcomputer/boot load- 
er (MCBL/MP)function. This feature is unique to the TMS320C31 and is not 
available on the TMS320C30 devices. The source code for the boot loader is 
supplied in Appendix Q. 



3.4.1 Boot-Loader Operations 

The boot loader lets you load and execute programs that are received from a 
host processor, inexpensive EPROMs, or other standard memory devices. 
The programs to be loaded either reside in one of three memory mapped areas 
identified as Boot 1 , Boot 2, and Boot 3 (see the shaded areas of Figure 3-8), 
or they are received by means of the serial port. 

User-definable byte, half-word, and word-data formats, as well as 32-bit fixed 
burst loads from the TMS320C31 serial port, are supported. See Section 8.2 
on page 8-13 for a detailed description of the serial-port operation. 



3.4.2 Invoking the Boot Loader 

The boot-loader function is selected by resetting the p rocessor while driving 
the MCBL/MP pin high. Use interrupt pins INT3 - INTO to set the mode of the 
boot load operation. Figure 3-14 shows the flow of this operation, which de- 
pends on the mode selected (external memory or serial boot). Figure 3-15 
shows memory load operations; Figure 3-16 shows serial port load opera- 
tions. 
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Figure 3-14. Boot-Loader-Mode Selection Flowchart 
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Figure 3-15. Boot-Loader Memory-Load Flowchart 
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Figure 3-16. Boot-Loader Serial-Port Load-Mode Flowchart 
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I t ■ 
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Block Loaded 

r 

Begin Program Execution 



3.4.3 Mode Selection 

After reset, the loader mode is determined by polling the status of the 
INT3-INT0 bits of the IF register. The bits are polled in the order described in 
the flowchart in Figure 3-14 on page 3-27. Table 3-7 lists the mode options 
and the interrupt that you c an use to set the particular mode. The interrupt can 
be driven any time after the RESET pin has been deasserted. Unless only one 
interrupt flag bit is set (INTO, INT1 , INT2, or INT3), the boot mode cannot be 
guaranteed. 
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Table 3-7. Loader Mode Selection 



Active Interrupt 


Loader Mode 


Memory Addressee 


INTO 


External memory 


Boot 1 address 0x001 OCX) 


INT1 


External memory 


Boot 2 address 0x400000 


INT2 


External memory 


Boot 3 address OxFFFOOO 


INT3 


32-bit serial 


Serial port 0 



3.4.4 External Memory Loading 

Table 3-6 shows and describes the information that you must specify to define 
boot memory organization (8, 1 6, or 32 bits), the code block size, the load des- 
tination address, and memory access timing control for the boot memory. You 
must specify this information before a source program can be externally 
loaded. 

This information must be specified in the first four locations of the Boot 1 , Boot 
2, or Boot 3 areas. The header is followed by the data or program code that 
is the block size in length. 



Table 3-8. External Memory Loader Header 



Location 


Description 


Valid Data Entries 


0 


Boot memory type (8, 1 6, or 32) 


0x8, 0x10, or 0x20 specified as a 32-bit number 


1 


Boot memory configuration 
(defined # of wait states, etc.) 


See Chapter 7 for valid bus-control register entries. 


2 


Program block size (blk) 


Any value 0 < blk < 224 


3 


Destination address 


Any valid TMS320C31 24-bit address 


4 


Program code starts here 


Any 32-bit data value or valid TMS320C3x instruction 



The loader fetches 32 bits of data for each specified location, regardless of 
what memory configuration width is specified. The data values must reside 
within or be written to memory, beginning with the value of least significance 
for each 32 bits of information. 



3.4.5 Examples of External Memory Loads 

Example 3-1 , Example 3-2, and Example 3-3 show memory images for 
byte-wide, 16-bit-wide, and 32-bit-wide configured memory. 
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These examples assume the following: 

□ An INTO signal was detected after reset was deasserted (signifying an ex- 
ternal memory load from Boot 1). 

□ The loader header resides at memory location 0x1 000 and defines the fol- 
lowing: 

■ Boot memory type EPROMs that require two wait states and SWW =11, 

■ A loader destination address at the beginning of the TMS320C31 's in- 
ternal RAM Block 1 , and 

■ A single block of memory that is 0x1 FF in length. 
Example 3-1. Byte- Wide Configured Memory 



Address 


Value 


Comments 


0x1000 


0x08 


Memory width = 8 bits 


0x1001 


0x00 




0x1002 


0x00 




0x1003 


0x00 




0x1004 


0x58 


Memory type = SWW = 11, WCNT = 2 


0x1005 


0x10 




0x1006 


0x00 




0x1007 


0x00 




0x1008 


OxFF 


Program code size = 0x1 FF 


0x1009 


0x01 




0x1 00A 


0x00 




0x1 00B 


0x00 




0x1 00C 


0x00 


Program load starting address = 0x809C00 


0x1 00D 


0x9C 




0x1 00E 


0x80 




0x1 OOF 


0x00 
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Example 3-2. 16- Bit-Wide Configured Memory 



Address 


Value 


Comments 


r\.,<4 AAA 

OXl 000 


Aw 4 A 

0x10 


Memory width = 1 o 


0x1001 


0x0000 




Aw A AAO 
0X1002 


Aw4 ACO 

0X1 058 


lilAmAM> Km* C\AAA/ A A IA/^MT O 

Memory type = oww = n , wont = 2 


n v i aaq 
UXl UUo 


uxuuuu 




0x1004 


0x1 FF 


Program code size = 0x1 FF 


0x1005 


0x0000 




0x1006 


0x9C00 


Program load starting address = 0x809000 


0x1007 


0x0080 





Example 3-3.32-Bit-Wide Configured Memory 



Address 


Value 


Comments 


0x1000 


0x00000020 


Memory width = 32 


0x1001 


0x00001058 


Memory type = SWW = 11, WONT = 2 


0x1002 


0x000001 FF 


Program code size = 0x1 FF 


0x1003 


0x00809000 


Program load starting address = 0x809000 



After reading the header, the loader transfers blk, 32-bit words beginning at a 
specified destination address. Code blocks require the same byte and half- 
word ordering conventions. The loader can also load multiple code blocks at 
different address destinations. 

After loading all code blocks, the boot loader branches to the destination ad- 
dress of the first block loaded and begins program execution. Consequently, 
the first code block loaded should be a start-up routine to access the other 
loaded programs. 

Each code block has the following header: 

BLK size 1st location 

Destination address 2nd location 

End the loader function and begin execution of the first code block by append- 
ing the value of 0x00000000 to the last block. 
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It is assumed that at least one block of code will be loaded when the 
loader Is invoked. Initial loader Invocation with e block size of 
0x00000000 produces unpredictable results. 



3.4.6 Serial-Port Loading 

Boot loads, by way of the TMS320C31 serial port, are selected by driving the 
INT3 pin active (low) following reset. The loader automatically configures the 
serial port for 32-bit fixed-burst-mode reads. It is interrupt-driven by the frame 
synchronization receive (FSR) signal. You cannot change this mode for boot 
loads. Your hardware must externally generate the serial-port clock and FSR. 

As in parallel loading, a header must precede the actual program to be loaded. 
However, you need only apply the block size and destination address because 
the loader and your hardware have predefined serial-port speed and data for- 
mat (i.e., skip data words 0 and 1 from Table 3-8). 

The transferred data-bit order must begin with the MSB and end with the LSB. 



3.4.7 Interrupt and Trap-Vector Mapping 

Unlike the microprocessor mode, the microcomputer/boot-loader (MCBL) 
mode uses a dual-vectoring scheme to service interrupt and trap requests. 
Dual vectoring was implemented to ensure code compatibility with future ver- 
sions of TMS320C3x devices. 

In a dual-vectoring scheme, branch instructions to an address, rather than di- 
rect-interrupt vectoring, are used. The normal interrupt and trap vectors are 
def ined to vector to the last 63 locations in the on-chip RAM , starting at address 
809FC1 h. When the loader is invoked, the last 63 locations in RAM Block 1 of 
the TMS320C31 are assumed to contain branch instructions to the interrupt 
source routines. 



Take care to ensure that these locations are not inadvertently 
overwritten by loaded program or data values, 
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Table 3-9 shows the MCBL/MP mode interrupt and trap instruction memory 
maps. 

Table 3-9. TMS320C31 Interrupt and Trap Memory Maps 



Address 


Description 


809FC1 


INTO 




iMTi 
■ i^i i i 


809FC3 


INT2 


809FC4 


INT3 


80QFC5 




809FC6 


RINTO 


809FC7 


noooi vou 


809FC8 


Reserved 


809FC9 


TINTO 


809FCA 


TINT1 


809FCB 


DINTO 


809FCC-809FDF 


Reserved 


809FE0 


TRAPO 


809FE1 

• 
• 


TRAP1 

• 

• 


• 

809FFB 


• 

TRAP27 


809FFC-809FFF 


Reserved 
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3.4.8 Precautions 

The boot loader builds a one-word-deep stack, starting at location 809801 h. 



Avoid loading code at location 809801 h. 



The interrupt flags are not reset by the boot-loader function. If pending inter- 
rupts are to be avoided when interrupts are enabled, clear the IF register be- 
fore enabling interrupts. 

The MCBU/MP pin should remain high during the entire boot-loader execution, 
but it can be changed subsequently at any time. The TMS320C31 does not 
need to be reset after the MCBL/MP pin is changed. During the change, the 
TMS320C31 should not access addresses Oh-FFFh. 
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Data Formats and Floating-Point Operation 



In the TMS320C3x architecture, data is organized into three fundamental 
types: integer, unsigned-integer, and floating-point. The terms integer and 
signed-integer are considered to be equivalent. The TMS320C3x supports 
short and single-precision formats for signed and unsigned integers. It also 
supports short, single-precision, and extended-precision formats for float- 
ing-point data. 

Floating-point operations make fast, trouble-free, accurate, and precise com- 
putations. Specifically, the TMS320C3x implementation of floating-point arith- 
metic facilitates floating-point operations at integer speeds while preventing 
problems with overflow, operand alignment, and other burdensome tasks 
common in integer operations. 

This chapter discusses in detail the data formats and floating-point operations 
supported in the TMS320C3x. Major topics in this section are as follows: 



Topic Page 

4.1 Integer Formats ........ 4-2 

4.2 Unsigned-Integer Formats 4-3 

4.3 Floating-Point Formats . . . . . .... 4-4 

4.4 Floating-Point Multiplication — ... 4-10 

4.5 Floating-Point Addition and Subtraction 4-14 

4.6 Normalization Using the NORM Instruction 4-18 

4.7 Rounding: The RND Instruction . . . — ... 4-20 

4.8 Floating-Point-to-lnteger Conversion 4-22 

4.9 Integer-to-Floating-Point Conversion 4-24 
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4.1 Integer Formats 

The TMS320C3x supports two integer formats: a 16-bit short integer format 
and a 32-bit single-precision integer format. When extended-precision regis- 
ters are used as integer operands, only bits 31-0 are used; bits 39-32 remain 
unchanged and unused. 



4.1 .1 Short-Integer Format 

The short integer format is a 1 6-bit two's complement integer format for imme- 
diate integer operands. For those instructions that assume integer operands, 
this format is sign-extended to 32 bits (see Figure 4-1). The range of an 
integer si, represented in the short integer format, is -2 15 * si * 2 15 - 1 . In 
Figure 4-1 , s = signed bit. 

Figure 4-1. Short Integer Format and Sign Extension of Short Integers 

15 0 

s 

(a) Short Integer Format 

31 16 15 0 

ssssssssssssssss 

(b) Sign Extension of a Short Integer 



4.1 .2 Single-Precision Integer Format 

In the single-precision integer format, the integer is represented in two's com- 
plement notation. The range of an integer sp, represented in the single-preci- 
sion integer format, is -2 31 £ sp * 2 31 - 1 . Figure 4-2 shows the single-preci- 
sion integer format. 

Figure 4-2. Single-Precision Integer Format 
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4.2 Unsigned-Integer Formats 



The TMS320C3x supports two unsigned-integer formats: a 16-bit short format 
and a 32-bit single-precision format. In extended-precision registers, the un- 
signed-integer operands use only bits 31-0; bits 39-32 remain unchanged. 



4.2.1 Short Unsigned-Integer Format 



Figure 4-3 shows thel 6-bit, short, unsigned-integer format for immediate un- 
signed-integer operands. For those instructions that assume 
unsigned-integer operands, this format is zero-filled to 32 bits. In Figure 4-3, 
x = most significant bit (MSB) (1 or 0). 



Figure 4-3. Short Unsigned-Integer Format and Zero Fill 



15 



(a) Short Unsigned-Integer Format 
31 16 15 0 



0000000000000000 x 



(b) Zero Fill of a Short Unsigned Integer 



4.2.2 Single-Precision Unsigned-Integer Format 



In the single-precision unsigned-integer format, the number is represented < 
a 32-bit value, as shown in Figure 4-4. 



Figure 4-4. Single-Precision Unsigned-Integer Format 
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4.3 Floating-Point Formats 

All TMS320C3x floating-point formats consist of three fields: an exponent field 
(e) , a single-bit sign field (s) , and a fraction field (f ) . These are stored as shown 
in Figure 4-5. The exponent field is a two's complement number. The sign field 
and fraction field may be considered one unit and referred to as the mantissa 
field (man). The two's complement fraction is combined with the sign bit and 
the implied most significant bit to create the mantissa. The mantissa repre- 
sents a normalized two's complement number. A normalized representation 
implies a most significant nonsign bit/thus providing additional precision. The 
value of a floating-point number x as a function of the fields e, s, and f is given as 

x = 01 .f x 2 e if s = 0, or if the leading 0 is the sign bit and the 
1 is the implied most significant nonsign bit 

1 0.f x 2 e if s = 1 , or if the leading 1 is the sign bit and the 
0 is the implied most significant nonsign bit 

0 if e = most negative two's complement 

value of the specified exponent field width 



Figure 4-5. Generic Floating-Point Format 



man (mantissa) ■ 



Note: e = exponent field 

s = single-bit sign field 
f = fraction field 



Three floating-point formats are supported on the TMS320C3x. The first is a 
short floating-point format for immediate floating-point operands, consisting of 
a 4-bit exponent, a sign bit, and an 1 1 -bit fraction. The second is a single-preci- 
sion format consisting of an 8-bit exponent, a sign bit, and a 23-bit fraction. The 
third is an extended-precision format consisting of an 8-bit exponent, a sign 
bit, and a 31 -bit fraction. 



4.3.1 Short Floating-Point Format 

In the short floating-point format, floating-point numbers are represented by 
a two's complement 4-bit exponent field (e) and a two's complement 12-bit 
mantissa field (man) with an implied most significant nonsign bit. See 
Figure +-6. 
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Figure 4-6. Short Floating-Point Format 



15 12 



11 10 



o 



e 



8 



mantissa 



Operations are performed with an implied binary point between bits 1 1 and 1 0. 
When the implied most significant nonsign bit is made explicit, it is located to 
the immediate left of the binary point. The floating-point two's complement 
number x in the short floating-point format is given by the following: 

x = 01.f x2© if s = 0 
10.f x2 e if s = 1 

0 if e = -8 

You must use the following reserved values to represent 0 in the short float- 
ing-point format; 



The following examples illustrate the range and precision of the short float- 
ing-point format: 



e = -8 



s = 0 



f = 0 



Most Positive: 
Least Positive: 
Least Negative: 
Most Negative: 



x = (2 - 2~ 11 ) x2 7 = 2.5594 x 1 0* 
x= 1 x 2~ 7 = 7.8125 x 10-3 
x = (-1-2-11) x 2-7 = -7.8163 x 10-3 
X = -2 x 2 7 = -2.5600 x 10 2 
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4.3.2 Single-Precision Floating-Point Format 

In the single-precision format, the floating-point number is represented by an 
8-bit exponent field (e) and a two's complement 24-bit mantissa field (man) 
with an implied most significant nonsign bit. See Figure 4-7. 

Figure 4-7. Single-Precision Floating-Point Format 



31 24 


23 | 22 




0 


e 


s 


f 




i. — 


— mantissa 





Operations are performed with an implied binary point between bits 23 and 22. 
When the implied most significant nonsign bit is made explicit, it is located to 
the immediate left of the binary point. The floating-point number x is given by 
the following: 

x = 01 .f x 2 e if s = 0 
10.f x2© if s = 1 

0 if e = -8 

You must use the following reserved values to represent 0 in the single-preci- 
sion floating-point format: 

e = -128 

s = 0 

f = 0 

The following examples illustrate the range and precision of the single-preci- 
sion floating-point format. 

Most Positive: x = (2 - 2~23) x 21 27 = 3.4028234 x 1 038 
Least Positive: x = 1 x 2~ 1 27 = 5.877471 7 x 1 0~39 
Least Negative: x = (-1 -2-23) x 2-1 27 = _ 5.8774724 x 1 0-39 
Most Negative: x = -2 x 2 127 = - 3.4028236 x 1 0 3 8 

4.3.3 Extended-Precision Floating-Point Format 

In the extended-precision format, the floating-point number is represented by 
an 8-bit exponent field (e) and a 32-bit mantissa field (man) with an implied 
most significant nonsign bit. See Figure 4-8. 



4-6 



Floating-Point Formats 



Figure 4-8. Extended-Precision Floating-Point Format 



39 32 I 31 I 30 



mantissa ■ 



Operations are performed with an implied binary point between bits 31 and 30. 
When the implied most significant nonsign bit is made explicit, it is located to 
the immediate left of the binary point. The floating-point number x is given by 
the following: 

x = 01 .f x2© if s = 0 

10.fx2© ifs = 1 

0 if e = -128 

You must use the following reserved values to represent 0 in the extended-pre- 
cision floating-point format: 

e = -128 

s = 0 

f = 0 

The following examples illustrate the range and precision of the extended-pre- 
cision floating-point format: 

Most Positive: x = (2 - 2"23) x 2^ 27 = 3.4028234 x 1 038 
Least Positive: x = 1 x 2-127 = 5.8774717541 x 1038 
Least Negative: x = (-1-2-31) x 2-127 = -5.8774717569 x 10-39 
Most Negative: x = -2x2l27 = _ 3.4028236691 x 1 038 
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4.3.4 Conversion Between Floating-Point Formats 

Floating-point operations assume several different formats for inputs and out- 
puts. These formats often require conversion from one floating-point format to 
another (e.g., short floating-point format to extended-precision floating-point 
format). Format conversions occur automatically in hardware, with no over- 
head, as a part of the floating-point operations. Examples of the four conver- 
sions are shown in Figure 4-9, Figure 4-10, Figure 4-11, and Figure 4-12. 
When a floating-point format 0 is converted to a greater-precision format, it is 
always converted to a valid representation of 0 in that format. In Figure 4-9, 
Figure 4-10, Figure 4-11, and Figure 4-12, s = sign bit of the exponent. 

Figure 4-9. Converting From Short Floating-Point Format to Single-Precision 
Floating-Point Format 
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(a) Short Floating-Point Format 




31 27 24 


23 


22 






12 


11 


0 


ssssxxxx 


y 


y 






y 


0 


0 



(b) Single-Precision Floating-Point Format 



In this format, the exponent field is sign-extended, and the fraction field is filled 
with 0s. 

Figure 4-10. Converting From Short Floating-Point Format to Extended-Precision 
Floating-Point Format 



39 



35 32 31 30 



15 



12 11 10 



s x x x y y 



(a) Short Floating-Point Format 
20 19 



ssssxxxx y y 



y 0 



(b) Extended-Precision Floating-Point Format 



The exponent field in this format is sign-extended, and the fraction field is filled 
with 0s. 



4-8 



Floating-Point Formats 



Figure 4-11. Converting From Single-Precision Floating-Point Format to 
Extended-Precision Floating-Point Format 



31 



24 23 22 



39 



32 31 30 



(a) Single-Precision Floating-Point Format 
8 7 



(b) Extended-Precision Floating-Point Format 

The fraction field is filled with 0s. 

Figure 4-12. Converting From Extended-Precision Floating-Point Format to 
Single-Precision Floating-Point Format 



39 



32 31 30 



8 7 



x y 



y z 



(a) Extended-Precision Floating-Point Format 
31 24 23 22 



x y y 



(b) Single-Precision Floating-Point Format 



The fraction field is truncated. 
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4.4 Floating-Point Multiplication 

A floating-point number a can be written in floating-point format as in the fol- 
lowing formula: 

a = a(man) x 2«( ex P) 

where: 

a(man) is the mantissa and a(exp) is the exponent. 

The product of a and b is c, defined as: 

c = a x b = a(man) x b(man) x 2(<*(©xP) + b (exp)) 

where: 

c(man) = a(man) x b(man), and 
c(exp) = a(exp) + b(exp) 

During floating-point multiplication, source operands are always assumed to 
be in the single-precision floating-point format. If the source of the operands 
is in short floating-point format, it is extended to the single-precision float- 
ing-point format. If the source of the operands is in extended-precision float- 
ing-point format, it is truncated to single-precision format. These conversions 
occur automatically in hardware with no overhead. All results of floating-point 
multiplications are in the extended-precision format. These multiplications oc- 
cur in a single cycle. 

A flowchart for floating-point multiplication is shown in Figure 4-1 3. In step 1 , 
the 24-bit source operand mantissas are multiplied, producing a 50-bit result 
c(man). (Note that input and output data are always represented as normal- 
ized numbers.) In step 2, the exponents are added, yielding c(exp). Steps 3 
through 6 check for special cases. Step 3 checks for whether c(man) in exten- 
ded-precision format is equal to 0. If c(man) is 0, step 7 sets c(exp) to -128, 
thus yielding the representation for 0. 

Steps 4 and 5 normalize the result. If a right shift of 1 is necessary, then in step 
8, c(man) is right-shifted 1 bit, thus adding 1 to c(exp). If a right shift of 2 is nec- 
essary, then in step 9, c(man) is right-shifted 2 bits, thus adding 2 to c(exp). 
Step 6 occurs when the result is normalized. 

In step 1 0, c(man) is set in the extended-precision floating-point format. Steps 
1 1 through 1 6 check for special cases of c(exp). If c(exp) has overflowed (step 
1 1 ) in the positive direction, then step 1 4 sets c(exp) to the most positive exten- 
ded-precision format value. If c(exp) has overflowed in the negative direction, 
then step 1 4 sets c(exp) to the most negative extended-precision format value. 
If c(exp) has underf lowed (step 12), then step 15 sets c to 0; that is, c(man) 
= 0 and c(exp) =-128. 



4-10 



Floating-Point Multiplication 



Figure 4-13. Flowchart for Floating-Point Multiplication 
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Example 4-1 , Example 4-2, Example 4-3, Example 4-4, and Example 4-5 
illustrate how floating-point multiplication is performed on the TMS320C3x. 
For these examples, the implied most significant nonsign bit is made explicit. 

Example 4-1. Floating-Point Multiply (Both Mantissas = -2.0) 
Let: 

a = -2.0 x 2«(exP) = 10 .00000000000000000000000 x 2«( ex P) 
b = -2.0 x 2 b (* x P) = 10 .00000000000000000000000 x 2*>(exp) 

where: 

a and b are both represented in binary form according to the normalized sing- 
le-precision floating-point format. 

Then: 

10.00000000000000000000000 x 2<*(exp) 
x 10.00000000000000000000000 x 2*>(exp) 

0100.0000000000000000000000000000000000000000000000 x 2 M* X P) + b (©xp)) 



To place this number in the proper normalized format, it is necessary to shift 
the mantissa two places to the right and add 2 to the exponent. This yields: 

10.00000000000000000000000 x 2<*( ex P) 
x 10.00000000000000000000000 x 2 b ( ex P) 

01 .0000000000000000000000000000000000000000000000 x 2(a( ex P) + b(exp) + 2) 

In floating-point multiplication, the exponent of the result may overflow. This 
can occur when the exponents are initially added or when the exponent is mo- 
dified during normalization. 

Example 4-2. Floating-Point Multiply (Both Mantissas = 1.5) 
Let: 

a = 1.5x2«(exp) = 01.10000000000000000000000 x 2«(© X P) 

b = 1.5x2 b ( ex P) = 01.10000000000000000000000 x 2 b (© x P) 

where a and b are both represented in binary form according to the single-pre- 
cision floating-point format. Then: 

01 .10000000000000000000000 x 2«(exp) 
x 01 .10000000000000000000000 x 2 b fcxp) 

0010.0100000000000000000000000000000000000000000000 x 2 («( ex P) + b (exp)) 
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To place this number in the proper normalized format, it is necessary to shift 
the mantissa one place to the right and add 1 to the exponent. This yields: 

01.10000000000000000000000 x 2«( ex P) 
x 01 .10000000000000000000000 x 2 b (* x P) 

01 .00100000000000000000000000000000000000000000000 x 2 M ex P) + b(exp) + 1) 

Example 4-3. Floating-Point Multiply (Both Mantissas =1.0) 
Let: 

a = 1 .0 x 2<*(e x P) = 01 .00000000000000000000000 x 2<*( ex P) 
b = 1 .0 x 2b(exp) = Q1 .00000000000000000000000 x 2 b ( ex P) 

where a and b are both represented in binary form according to the single-pre- 
cision floating-point format. Then: 

01 .00000000000000000000000 x 2«( QX P) 
x 01 .00000000000000000000000 x 2 b ( ex P) 

0001 .0000000000000000000000000000000000000000000000 x 2 («(* X P) + b (© x P)) 

This number is in the proper normalized format. Therefore, no shift of the man- 
tissa or modification of the exponent is necessary. 

These examples have shown cases where the product of two normalized num- 
bers can be normalized with a shift of 0, 1 , or 2. For all normalized inputs with 
the floating-point format used by the TMS320C3x, a normalized result can be 
produced by a shift of 0, 1 , or 2. 

Example 4-4. Floating-Point Multiply Between Positive and Negative Numbers 
Let: 

a = 1 .0 x 2«(© X P) = 01 .00000000000000000000000 x 2«( ex P) 
b = -2.0 x 2 b (exp) = 1 0 .00000000000000000000000 x 2 b ( ex P) 

Then: 

01 .00000000000000000000000 x 2 a ( ex P) 
x 1 0 .00000000000000000000000 x 2 b (® x P) 

1110.0000000000000000000000000000000000000000000000 x 2 (<*(© X P) + b (* x P)) 
The result is c = -2.0 x 2(«(© X P) + b ( ex P)) 

Example 4-5. Floating-Point Multiply by 0 

All multiplications by a floating-point 0 yield a result of 0 (f = 0, s = 0, and exp 
= -128). 
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4.5 Floating-Point Addition and Subtraction 

In floating-point addition and subtraction, two floating-point numbers a and b 
can be defined as: 

a = a(man) x 2 <*( ex P) 
b = b(man) x 2 b(exp) 

The sum (or difference) of a and b can be defined as: 
c = a ± b 

= (a(man) ± (b(man) x 2 -(a(exp)-b(exp)))) x 2 a(exp), 

if a(exp) as b(exp) 
= ((a(man) x 2 -(b(exp)-a(exp))) ± b(man)) x 2 *>(exp) f 

if a(exp) < b(exp) 

The flowchart for floating-point addition is shown in Figure 4-14. Since this 
flowchart assumes signed data, it is also appropriate for floating-point subtrac- 
tion. In this figure, it is assumed that a(exp) * b(exp). In step 1 , the source ex- 
ponents are compared, and c(exp) is set equal to the largest of the two source 
exponents. In step 2, d is set to the difference of the two exponents. In step 3, 
the mantissa with the smallest exponent, in this case a (man), is right-shifted 
d bits to align the mantissas. After the mantissas have been aligned, they are 
added (step 4). 

Steps 5 through 7 check for a special case of c(man). If c(man) is 0 (step 5), 
then c(exp) is set to its most negative value (step 8) to yield the correct repre- 
sentation of 0. If c(man) has overflowed c (step 6), then c(man) is right-shifted 
one bit, and 1 is added to c(exp). Otherwise, step 1 0 normalizes c by left-shift- 
ing c(man) and subtracting c(exp) by the number of leading non-significant 
sign bits (step 7). Steps 11 through 13 check for special cases of c(exp). If 
c(exp) has overflowed (step 11) in the positive direction, then step 14 sets 
c(exp) to the most positive extended-precision format value. If c(exp) has over- 
flowed (step 1 1) in the negative direction, then step 1 4 sets c(exp) to the most 
negative extended-precision format value. If c(exp) has underf lowed (step 12), 
then step 15 sets c to 0; that is, c(man) = 0 and c(exp) = -128. 
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Figure 4-14. Flowchart for Floating-Point Addition 
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Example 4-6, Example 4-7, Example 4-8, and Example 4-9 describe the 
floating-point addition and subtraction operations. It is assumed that the data 
is in the extended-precision floating-point format. 

Example 4-6. Floating-Point Addition 

In the case of two normalized numbers to be summed, let 

a = 1 .5 = 01 .1 000000000000000000000000000000 x 2° 
b = 0.5 = 01 .0000000000000000000000000000000 x 2~ 1 

It is necessary to shift b to the right by 1 so that a and b have the same expo- 
nent. This yields: 

b = 0.5 = 00.1000000000000000000000000000000 x 20 
Then: 

01.10000000000000000000000000000000 x 20 
+ 00.10000000000000000000000000000000 x 20 

010.00000000000000000000000000000000 x 20 

As in the case of multiplication, it is necessary to shift the binary point one place 
to the left and add 1 to the exponent. This yields: 

01.1000000000000000000000000000000 x 20 
±00.1000000000000000000000000000000 x 20 

01 .0000000000000000000000000000000 x 2 1 
Example 4-7. Floating-Point Subtraction 

A subtraction is performed in this example. Let 

a = 01.0000000000000000000000000000001 x 2° 
b = 01 .0000000000000000000000000000000 x 20 

The operation to be performed is a-b. The mantissas are already aligned be- 
cause the two numbers have the same exponent. The result is a large cancel- 
lation of the upper bits, as shown below. 

01 .0000000000000000000000000000001 x 2° 
-01.0000000000000000000000000000000 x 20 

00.0000000000000000000000000000001 x 20 
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The result must be normalized. In this case, a left-shift of 31 is required. The 
exponent of the result is modified accordingly. The result is: 

01 .0000000000000000000000000000001 x 2° 
-01.0000000000000000000000000000000 x 20 

01 .0000000000000000000000000000000 x 2-31 
Example 4-8. Floating-Point Addition With a 32-Bit Shift 

This example illustrates a situation where a full 32-bit shift is necessary to nor- 
malize the result. Let 

a = 01.1111111111111111111111111111111 x 2 127 
b = 10.0000000000000000000000000000000 x 2^27 

The operation to be performed is a + b. 

01.1111111111111111111111111111111 x 2 127 
+ 10.0000000000000000000000000000000 x 2 127 
11.1111111111111111111111111111111 x 2 127 

Normalizing the result requires a left-shift of 32 and a subtraction of 32 from 
the exponent. The result is: 

01.1111111111111111111111111111111 x 2 i27 
+ 10.0000000000000000000000000000000 x 2 i27 
10.0000000000000000000000000000000 x 295 

Example 4-9. Floating-Point Addition/Subtraction With Floating-Point 0 

When floating-point addition and subtraction are performed with a float- 
ing-point 0, the following identities are satisfied: 

a ± 0 = a (a * 0) 

0±0 = 0 

0 -a = -a (a * 0) 
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4.6 Normalization Using the NORM Instruction 

The NORM instruction normalizes an extended-precision floating-point num- 
ber that is assumed to be unnormalized. See Example 4-10. Since the num- 
ber is assumed to be unnormalized, no implied most significant nonsign bit is 
assumed. The NORM instruction: 

1 ) Locates the most significant nonsign bit of the floating-point number, 

2) Left-shifts to normalize the number, and 

3) Adjusts the exponent. 

Example 4-10. NORM Instruction 

Assume that an extended-precision register contains the value 

man = 00000000000000000001000000000001 , exp = 0 

When the normalization is performed on a number assumed to be unnormal- 
ized, the binary point is assumed to be: 

man = 0.0000000000000000001000000000001, exp = 0 

This number is then sign-extended one bit so that the mantissa contains 33 
bits. 

man = 00.0000000000000000001000000000001, exp = 0 

The intermediate result after the most significant nonsign bit is located and the 
shift performed is: 

man = 01 .0000000000010000000000000000000, exp = -19 

The final 32-bit value output after removing the redundant bit is: 

man = 00000000000010000000000000000000, exp = -19 

The NORM instruction is useful for counting the number of leading 0s or lead- 
ing 1 s in a 32-bit field. If the exponent is initially 0, the absolute value of the final 
value of the exponent is the number of leading 1 s or 0s. This instruction is also 
useful for manipulating unnormalized floating-point numbers. 

Given the extended-precision floating-point value a to be normalized, the nor- 
malization, norm ( ), is performed as shown in Figure 4-15. 
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Figure 4-15. Flowchart for NORM Instruction Operation 
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4.7 Rounding: The RND Instruction 

The RND instruction rounds a number from the extended-precision float- 
ing-point format to the single-precision floating-point format. Rounding is simi- 
lar to floating-point addition. Given the number a to be rounded, the following 
operation is performed first. 

c * a(man) x 2<*(exP) + (1 x 2<*(exp)-24) 

Next, a conversion from extended-precision floating-point to single-precision 
floating-point format is performed. Given the extended-precision floating-point 
value, the rounding, rnd( ), is performed as shown in Figure 4-16. 
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Figure 4-16. Flowchart for Floating-Point Rounding by the RND Instruction 
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4.8 Floating-Point-to-lnteger Conversion 

Floating-point to integer conversion, using the FIX instructions, allows exten- 
ded-precision floating-point numbers to be converted to single-precision inte- 
gers in a single cycle. The floating-point to integer conversion of the value x 
is referred to here as fix(x). The conversion does not overflow if a, the number 
to be converted, is in the range 

-231 ^ a ^ 231 - 1 
First, you must be certain that 
a(exp) ss 30 

If these bounds are not met, an overflow occurs. If an overflow occurs in the 
positive direction, the output is the most positive integer. If an overflow occurs 
in the negative direction, the output is the most negative integer. If a(exp) is 
within the valid range, then a(man), with implied bit included, is sign-extended 
and right-shifted (rs) by the amount 

rs = 31 - a(exp) 

This right-shift (rs) shifts out those bits corresponding to the fractional part of 
the mantissa. For example: 

If 0 ss x < 1 , then fix(x) = 0. 
If -1 ss x < 0, then fix(x) = -1 . 

The flowchart for the floating-point-to-integer conversion is shown in 
Figure 4-17. 
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Figure 4-17. Flowchart for Floating-Point-to-lnteger Conversion by FIX Instructions 
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4.9 Integer-to-Floating-Point Conversion 

Integer to floating-point conversion, using the FLOAT instruction, allows sing- 
le-precision integers to be converted to extended-precision floating-point 
numbers. The flowchart for this conversion is shown in Figure 4-1 8. 

Figure 4-18. Flowchart for Integer-to-Floating-Point Conversion by FLOAT Instructions 
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The TMS320C3x supports five groups of powerful addressing modes. Six 
types of addressing may be used within the groups, which allow access of data 
from memory, registers, and the instruction word. This chapter details the op- 
eration, encoding, and implementation of the addressing modes. It also dis- 
cusses the management of system stacks, queues, and dequeues in memory. 



These are the major topics in this chapter: 

Topic Page 

5.1 types of Addressing 5-2 

5.2 Groups of Addressing Modes 5-19 

5.3 Circular Addressing 5-24 

5.4 Bit-Reversed Addressing 5-29 

5.5 System and User Stack Management .......................... 5-31 
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5.1 Types of Addressing 

Six types of addressing allow access of data from memory, registers, and the 
instruction word: 

□ Register 

□ Direct 

□ Indirect 

□ Short-immediate 

□ Long-immediate 

□ PC-relative 

Some types of addressing are appropriate for some instructions but not others. 
For this reason, the types of addressing are used in the five groups of address- 
ing modes as follows: 

□ General addressing modes (G): 

■ Register 

■ Direct 

■ Indirect 

■ Short-immediate 

□ Three-operand addressing modes (T): 

■ Register 

■ Indirect 

□ Parallel addressing modes (P): 

■ Register 

■ Indirect 

□ Conditional-branch addressing modes (B): 

■ Register 

■ PC-relative 

The six types of addressing are discussed first, followed by the five groups of 
addressing modes. 
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5.1 .1 Register Addressing 

In register addressing, a CPU register contains the operand, as shown in this 
example: 

ABSF Rl ; Rl » |R1| 

The syntax for the CPU registers, the assembler syntax, and the assigned 
function for those registers are listed in Table 5-1 . 

Table 5-1. CPU Register Address/Assembler Syntax and Function 





Assembler 


Assigned 


CPU Register Address 


Syntax 


Function 


oon 


QA 

HO 


Extended-precision register 


oin 


Ml 


txienaea-precision register 


AAL 

02 n 


DO 


Extended-precision register 


03h 


R3 


Extended-precision register 


04h 


R4 


Extended-precision register 


05h 


R5 


Extended-precision register 


06h 


R6 


Extended-precision register 


07h 


R7 


Extended-precision register 


08h 


AR0 


Auxiliary register 


09h 


AR1 


Auxiliary register 


OAh 


AR2 


Auxiliary register 


OBh 


AR3 


Auxiliary register 


OCh 


AR4 


Auxiliary register 


ODh 


AR5 


Auxiliary register 


OEh 


AR6 


Auxiliary register 


OFH 


AR7 


Auxiliary register 


10h 


DP 


Data-page pointer 


11h 


IR0 


Index register 0 


12h 


IR1 


Index register 1 


13h 


BK 


Block-size register 


14h 


SP 


Active stack pointer 


15h 


ST 


Status register 


16h 


IE 


CPU/DMA interrupt enable 


17h 


IF 


CPU interrupt flags 


18h 


IOF 


I/O flags 


19h 


RS 


Repeat start address 


1Ah 


RE 


Repeat end address 


1Bh 


RC 


Repeat counter 
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5.1 .2 Direct Addressing 



In direct addressing, the data address is formed by the concatenation of the 
eight least significant bits of the data page pointer (DP) with the 1 6 least signifi- 
cant bits of the instruction word (expr). This results in 256 pages (64K words per 
page), giving the programmer a large address space without requiring a change 
of the page pointer. The syntax and operation for direct addressing are: 

Syntax: @expr 

Operation: address = DP concatenated with expr 

Figure 5-1 shows the formation of the data address. Example 5-1 is an 
instruction example with data before and after instruction execution. 



Figure 5-1. Direct Addressing 
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Example 5-1 .Direct Addressing 



8 7 



X ... X 



page 



ADD I §0BCDEh,R7 

Before Instruction: 

DP = 8Ah 
R7 = 0h 

Data at 8ABCDEh = 1 2345678h 



expr 



O 1 G1 

0 0...0 0 


address 


31 


o 


operand 



After Instruction: 

DP = 8Ah 

R7 = 12345678h 

Data at 8ABCDEh = 12345678h 
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5.1 .3 Indirect Addressing 

Indirect addressing is used to specify the address of an operand in memory 
through the contents of an auxiliary register, optional displacements, and in- 
dex registers. Only the 24 least significant bits of the auxiliary registers and in- 
dex registers are used in indirect addressing. This arithmetic is performed by 
the auxiliary register arithmetic units (ARAUs) on these lower 24 bits and is un- 
signed. The upper eight bits are unmodified. 

The flexibility of indirect addressing is possible because the ARAUs on the 
TMS320C3x modify auxiliary registers in parallel with operations within the 
main CPU. Indirect addressing is specified by a five-bit field in the instruction 
word, referred to as the mod field. A displacement is either an explicit unsigned 
eight-bit integer contained in the instruction word or an implicit displacement 
of one. Two index registers, IRO and IR1 , can also be used in indirect address- 
ing. In some cases, an optional addressing scheme using circular or bit-rev- 
ersed addressing can be used. The mechanism for generating addresses in 
circular addressing is discussed in Section 5.3 on page 5-24; bit-reversed is 
discussed in Section 5.4 on page 5-29. 

i i 
Note: Auxiliary Register 

The auxiliary register (ARn) to be used is encoded in the instruction word ac- 
cording to its binary representation n (for example, AR3 is encoded as 11 2), 
not its register machine address (shown in Table 5-1). 

L. ' 

Example 5-2. Auxiliary Register Indirect 

An auxiliary register (ARn) contains the address of the operand to be fetched. 
Operation: operand address = ARn 

Assembler Syntax: *ARn 
Modification Field: 11000 



31 



ARn 



24 23 



X X 


address 


31 1 


r 0 


operand 



Table 5-2 lists the various kinds of indirect addressing, along with the value 
of the modification (mod) field, assembler syntax, operation, and function for 
each. The succeeding 17 examples show the operation for each kind of indi- 
rect addressing. Figure 5-2 shows the format in the instruction encoding. 
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Table 5-2. Indirect Addressing 



Mod Field Syntax 



Operation 



Description 



00000 
00001 
00010 

00011 

00100 

00101 

00110 

00111 



Indirect Addressing with Displacement 



*+ARn(disp) 
*-ARn(disp) 
*++ARn(disp) 

* — ARn(disp) 

*ARn++(disp) 

*ARn — (disp) 

*ARn++(disp)% 



addr = ARn + disp 

addr = ARn - disp 

addr = ARn + disp 
ARn = ARn + disp 

addr = ARn - disp 
ARn = ARn - disp 

addr = ARn 
ARn = ARn + disp 

addr = ARn 
ARn = ARn - disp 



addr = ARn 

ARn = circ(ARn + disp) 

*ARn — (disp)% addr = ARn 

ARn = circ(ARn - disp) 



With predispiacement add 

With predispiacement subtract 

With predispiacement add and modify 

With predispiacement subtract and modify 

With postdisplacement add and modify 

With postdisplacement subtract and modify 

With postdisplacement add and circular modify 

With postdisplacement subtract and circular 
modify 



Indirect Addressing with Index Register IRO 



01000 


*+ARn(IR0) 


addr 


= ARn + IRO 


With preindex (IRO) add 


01001 


*-ARn(IR0) 


addr 


= ARn -IRO 


With preindex (IRO) subtract 


01010 


*++ARn(IR0) 


addr 
ARn 


= ARn + IRO 
= ARn + IRO 


With preindex (IRO) add and modify 


01011 


* — ARn(IRO) 


addr 
ARn 


= ARn - IRO 
= ARn - IRO 


With preindex (IRO) subtract and modify 


01100 


*ARn++(IR0) 


addr 
ARn 


= ARn 

= ARn + IRO 


With postindex (IRO) add and modify 


01101 


*ARn-- -(IRO) 


addr: 
ARn 


= ARn 

= ARn - IRO 


With postindex (IRO) subtract and modify 


01110 


*ARn++(IR0)% 


addr 
ARn 


= ARn 

= circ(ARn + IRO) 


With postindex (IRO) add and circular 
modify 


01111 


*ARn— (IR0)% 


addr 
ARn 


= ARn 

= circ(ARn)-IRO 


With postindex (IRO) subtract and circular 
modify 



Legend: addr memory address 

ARn auxiliary register ARO-AR7 

circ() address in circular addressing 

disp displacement 



++ add and modify 

subtract and modify 
% where circular addressing is performed 



5-6 



Types of Addressing 



Table 5-2. Indirect Addressing (Continued) 



Mod Field Syntax 



Operation 



Description 



Indirect Addressing with Index Register IR1 



10000 


*+ARn(IR1) 


addr = ARn + IR1 


With preindex (IR1) add 


10001 


*-ARn(IR1) 


addr = ARn - IR1 


With preindex (IR1) subtract 


10010 


*++ARn(IR1) 


addr = ARn + IR1 
ARn = ARn + IR1 


With preindex (IR1) add 
and modify 


10011 


* — ARn(IR1) 


addr = ARn - IR1 
ARn = ARn - IR1 


With preindex (IR1) subtract 
and modify 


I U I uu 


Mrin ++ yir\ i) 


addr = ARn 
ARn = ARn + IR1 


With postindex (IR1) add 
and modify 


10101 


*ARn — (IR1) 


addr = ARn 
ARn = ARn - IR1 


With postindex (IR1) subtract 
and modify 


10110 


*ARn++(IR1)% 


addr = ARn 

ARn = circ(ARn + IR1) 


With postindex (IR1) add 
and circular modify 


10111 


*ARn--(IR1)% 


addr = ARn 

ARn = circ(ARn-IR1) 


With postindex (IR1) subtract 
and circular modify 






Indirect Addressing (Special Cases) 


11000 


*ARn 


addr = ARn 


Indirect 


11001 


*ARn++(IR0)B 


addr = ARn 

ARn = B(ARn + IRO) 


With postindex (IRO) add 
and bit-reversed modify 



Legend: 



addr 
ARn 
B 



memory address circQ 
auxiliary register AR0-AR7 ++ 
where bit-reversed addressing is performed % 



address in circular addressing 
add and modify 

where circular addressing is performed 



Example 5-3, Example 5-4, Example 5-5, Example 5-6, Example 5-7, 
Example 5-8, Example 5-9, Example 5-10, Example 5-11, Example 5-12, 
Example 5-13, Example 5-14, Example 5-15, Example 5-16, 
Example 5-1 7, Example 5-1 8, and Example 5-1 9 exemplify indirect addres- 
sing in Table 5-2. 



Figure 5-2. Instruction Encoding Format 

Most Significant Bit 



MOD 



ARn 



dispt 



Least Significant Bit 



5 Bits 3 Bits 0, 5, or 8 Bits 
t disp field may not exist in some instructions 
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Example 5-3. Indirect With Predisplacement Add 



The address of the operand to be fetched is the sum of an auxiliary register 
(ARn) and the displacement (disp). The displacement is either an eight-bit un- 
signed integer contained in the instruction word or an implied value of 1 . 

Operation: operand address = ARn + disp 

Assembler Syntax: *+ ARn (disp) 

Modification Field: 00000 



31 



ARn- 



31 



disp 



0...0 



31 



24 23 



address 



8 7 



integer 



W 



operand 



Example 5-4. Indirect With Predisplacement Subtract 



The address of the operand to be fetched is the contents of an auxiliary register 
(ARn) minus the displacement (disp). The displacement is either an eight-bit 
unsigned integer contained in the instruction word or an implied value of 1 . 

Operation: operand address = ARn - disp 

Assembler Syntax: *- ARn (disp) 

Modification Field: 00001 



31 



ARn 



24 23 



address 



31 



disp 



0...0 



31 



8 7 



integer 



operand 



Is. 
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Example 5-5. Indirect With Predisplacement Add and Modify 



The address of the operand to be fetched is the sum of an auxiliary register 
(ARn) and the displacement (disp). The displacement is either an eight-bit 
unsigned integer contained in the instruction word or an implied value of 1 . 
After the data is fetched, the auxiliary register is updated with the address gen- 
erated. 

Operation: operand address = ARn + disp 

ARn = ARn + disp 
Assembler Syntax: *++ ARn (disp) 

Modification Field: 0001 0 



31 



ARn- 



24 23 



address 



31 



disp 



0...0 



31 



8 7 



integer 



(+) 



operand 



Example 5-6. Indirect With Predisplacement Subtract and Modify 

The address of the operand to be fetched is the contents of an auxiliary register 
(ARn) minus the displacement (disp). The displacement is either an eight-bit 
unsigned integer contained in the instruction word or an implied value of 1 . Af- 
ter the data is fetched, the auxiliary register is updated with the address gener- 
ated. 

Operation: operand address = ARn - disp 

ARn = ARn - disp 

Assembler Syntax: * — ARn(disp) 
Modification Field: 00011 







31 


24 23 










0 




ARn — * 


X 


X 


address 


31 








8 


7 0 






1 


r" 


0 


0...0 






0 


integer 


— ► (- 


-) 










31 








i 


r 




0 






operand 
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Example 5-7. Indirect With Postdisplacement Add and Modify 

The address of the operand to be fetched is the contents of an auxiliary register 
(ARn). After the operand is fetched, the displacement (disp) is added to the 
auxiliary register. The displacement is either an eight-bit unsigned integer con- 
tained in the instruction word or an implied value of 1 . 

Operation: operand address = ARn 

ARn = ARn + disp 

Assembler Syntax: *ARn ++ (disp) 

Modification Field: 00100 



31 



24 23 



ARn 



address 



31 



8 7 



disp 0 



0...0 



0 integer 



(+)■ 



31 



operand 



Example 5-8. Indirect With Postdisplacement Subtract and Modify 

The address of the operand to be fetched is the contents of an auxiliary register 
(ARn). After the operand is fetched, the displacement (disp) is subtracted from 
the auxiliary register. The displacement is either an eight-bit unsigned integer 
contained in the instruction word or an implied value of 1 . 

Operation: operand address = ARn 

ARn = ARn - disp 

Assembler Syntax: *ARn — (disp) 

Modification Field: 00101 



31 



24 23 



ARn 



address 



31 



8 7 



disp 



0...0 



integer 



(-) 



31 



operand 
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Example 5-9. Indirect With Postdisplacement Add and Circular Modify 

The address of the operand to be fetched is the contents of an auxiliary register 
(ARn). After the operand is fetched, the displacement (disp) is added to the 
contents of the auxiliary register using circular addressing. This result is used 
to update the auxiliary register. The displacement is either an eight-bit un- 
signed integer contained in the instruction word or an implied value of 1 . 

Operation: operand address = ARn 

ARn = circ(ARn+disp) 
Assembler Syntax: *ARn ++ (disp)% 

Modification Field: 00110 







31 


24 23 








0 




ARn » 


X 


X 


address 


31 








8 


7 0 


T 

(%) 






0 


0...0 






0 


integer 


i 

— ► (+) 




> 






31 










1 








operand 



Example 5-10. Indirect With Postdisplacement Subtract and Circular Modify 

The address of the operand to be fetched is the contents of an auxiliary register 
(ARn) . After the operand is fetched, the displacement (disp) is subtracted from 
the contents of the auxiliary register using circular addressing. This result is 
used to update the auxiliary register. The displacement is either an eight-bit 
unsigned integer contained in the instruction word or an implied value of 1 . 

Operation: operand address = ARn 

ARn = circ(ARn - disp) 
Assembler Syntax: *ARn — (disp)% 

Modification Field: 00111 







31 


24 


23 










0 




ARn » 


X 


X 


address 


31 








8 


7 0 




t 

(%) 






0 


0...0 






0 


integer 




H 




> 






31 














r 0 






operand 
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Example 5-11. Indirect With Preindex Add 



The address of the operand to be fetched is the sum of an auxiliary register 
(ARn) and an index register (IRO or IR1). 

Operation: operand address = ARn + IRm 

Assembler Syntax: *+ ARn (IRm) 



Modification Field: 



01000 
10000 



if m = 0 
if m = 1 



31 



ARn 



31 



24 23 



IRm- 



31 



24 23 



index 



address 



operand 



-> (+) 



Example 5-12. Indirect With Preindex Subtract 



The address of the operand to be fetched is the difference of an auxiliary regis- 
ter (ARn) and an index register (IRO or IR1). 

Operation: operand address = ARn - IRm 

Assembler Syntax: *- ARn(IRm) 

Modification Field: 01001 if m = 0 

10001 ifm=1 



31 



ARn 



24 23 



address 



31 24 23 



IRm — t 



31 



index 



operand 
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Example 5-13. Indirect With Preindex Add and Modify 

The address of the operand to be fetched is the sum of an auxiliary register 
(ARn) and an index register (IRO or IR1). After the data is fetched, the auxiliary 
register is updated with the address generated. 

Operation: operand address = ARn + IRm 

ARn = ARn + IRm 

Assembler Syntax: *++ ARn (IRm) 

Modification Field: 01010 if m = 0 

10010 ifm=1 



31 



24 23 



ARn- 



address 



31 



24 23 



IRm- 



index 



31 



operand 



Example 5-14. Indirect With Preindex Subtract and Modify 

The address of the operand to be fetched is the difference between an auxiliary 
register (ARn) and an index register (IRO or I R1). The resulting address be- 
comes the new contents of the auxiliary register. 

Operation: operand address = ARn - IRm 

ARn = ARn - IRm 

Assembler Syntax: * — ARn(IRm) 

Modification Field: 01011 if m = 0 

10011 ifm = 1 



31 



IRm 



31 



ARn- 



24 23 



31 



24 23 



address 



index 



operand 
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Example 5-15. Indirect With Postindex Add and Modify 

The address of the operand to be fetched is the contents of an auxiliary register 
(ARn). After the operand is fetched, the index register (IRO or IR1) is added 
to the auxiliary register. 

Operation: operand address = ARn 

ARn = ARn + IRm 

Assembler Syntax: *ARn ++ (IRm) 

Modification Field: 01 1 00 if m = 0 

10100 ifm = 1 



31 



ARn 



24 23 



address 



31 



24 23 



IRm 



index 



31 



operand 



Example 5-16. Indirect With Postindex Subtract and Modify 

The address of the operand to be fetched is the contents of an auxiliary register 
(ARn). After the operand is fetched, the index register (IRO or IR1) is sub- 
tracted from the auxiliary register. 

Operation: operand address = ARn 

ARn = ARn - IRm 

Assembler Syntax: *ARn — (IRm) 

Modification Field: 01101 if m = 0 

10101 ifm = 1 



31 



24 23 



ARn- 



address 



31 



24 23 



IRm- 



index 



31 



operand 
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Example 5-17. Indirect With Postindex Add and Circular Modify 

The address of the operand to be fetched is the contents of an auxiliary register 
(ARn). After the operand is fetched, the index register (IRO or IR1) is added 
to the auxiliary register. This value is evaluated using circular addressing and 
replaces the contents of the auxiliary register. 

Operation: operand address = ARn 

ARn = circ(ARn + IRm) 

Assembler Syntax: *ARn ++ (IRm)% 

Modification Field: 01110 if m = 0 

10110 ifm = 1 



31 



ARn- 



31 



24 23 



IRm 



31 



24 23 



address 



index 



T 

(%) 

w 



operand 



Example 5-18. Indirect With Postindex Subtract and Circular Modify 

The address of the operand to be fetched is the contents of an auxiliary register 
(ARn). After the operand is fetched, the index register (IRO or IR1) is sub- 
tracted from the auxiliary register. This result is evaluated using circular ad- 
dressing and replaces the contents of the auxiliary register. 

Operation: operand address = ARn 

ARn = circ(ARn - IRm) 

Assembler Syntax: *ARn — (IRm)% 

Modification Field: 01111 if m = 0 

10111 ifm = 1 



31 



ARn 



31 



24 23 



IRm 



31 



24 23 



address 



index 



T 

(%) 
(-) 



operand 
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Example 5-19. Indirect With Postindex Add and Bit-Reversed Modify 

The address of the operand to be fetched is the contents of an auxiliary register 
(ARn). After the operand is fetched, the index register (IRO) is added to the 
auxiliary register. This addition is performed with a reverse-carry propagation 
and can be used to yield a bit-reversed (B) address. This value replaces the 
contents of the auxiliary register. 

Operation: operand address = ARn 

ARn = B(ARn + IRO) 
Assembler Syntax: *ARn ++ (IRO)B 

Modification Field: 11001 



31 

ARn » x 



31 



24 23 



IRm- 



31 



24 23 



address 



index 



T" 

(B) 
(+) 



operand 



5.1 .4 Short-Immediate Addressing 

In short-immediate addressing, the operand is a 16-bit immediate value con- 
tained in the 1 6 least significant bits of the instruction word (expr). Depending 
on the data types assumed for the instruction, the short-immediate operand 
can be a two's complement integer, an unsigned integer, or a floating-point 
number. This is the syntax for this mode: 

Syntax: expr 
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Example 5-20 illustrates before- and after-instruction data. 
Example 5-20. Short-Immediate Addressing 

SUBI 1,R0 

Before Instruction: After Instruction: 

R0 = Oh R0 = OFFFFFFFFh 



5.1 .5 Long-Immediate Addressing 

In long-immediate addressing, the operand is a 24-bit immediate value con- 
tained in the 24 least significant bits of the instruction word (expr). This is the 
syntax for this mode: 

Syntax: expr 

Example 5-21 illustrates before- and after-instruction data. 
Example 5-21. Long-Immediate Addressing 

BR 8000h 

Before Instruction: After Instruction: 

PC = Oh PC = 8000h 

5.1 .6 PC-Relative Addressing 

Program counter (PC)-relative addressing is used for branching. It adds the 
contents of the 1 6 or 24 least significant bits of the instruction word to the PC 
register. The assembler takes the src (a label or address) specified by the user 
and generates a displacement. If the branch is a standard branch, this dis- 
placement is equal to [label - (instruction address +1)]. If the branch is a 
delayed branch, this displacement is equal to [label - (instruction ad- 
dress +3)]. 

The displacement is stored as a 1 6-bit or 24-bit signed integer in the least sig- 
nificant bits of the instruction word. The displacement is added to the PC during 
the pipeline decode phase. Notice that because the PC is incremented by 1 
in the fetch phase, the displacement is added to this incremented PC value. 

Syntax: expr (src) 

Example 5-22 illustrates before- and after-instruction data. 
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Example 5-22. PC-Relative Addressing 

BU NEWPC ? pc=1001h, NEWPC label - 1005h, displacement « 3 

Before Instruction After Instruction 

decode phase: execution phase: 

PC = 1002h PC = 1005h 

The 24-bit addressing mode encodes the program control instructions (for ex- 
ample, BR, BRD, CALL, RPTB, and RPTBD). Depending on the instruction, 
the new PC value is derived by adding a 24-bit signed value in the instruction 
word with the present PC value. Bit 24 determines the type of branch (D = 0 
for a standard branch or D = 1 for a delayed branch). Some of the instructions 
are encoded in Figure 5-3. 

Figure 5-3. Encoding for 24-Bit PC-Relative Addressing Mode 





(a) BR, BRD: unconditional branches (standard and delayed) 




31 


25 24 23 


0 


|o 1 


1 0 0 0 0 1 0 | displacement 


I 




(b) CALL: unconditional subroutine call 




31 


24 23 


0 


|o 1 


1 0 0 0 1 1 0 | displacement 


I 




(c) RPTB: repeat block 




31 


25 24 23 


0 


1. 1 


1 0 0 1 0 1 0 | displacement 


I 
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5.2 Groups of Addressing Modes 

Six types of addressing (covered in Section 5.1 , beginning on page 5-2) form 
these four groups of addressing modes: 

□ General addressing modes (G) 

□ Three-operand addressing modes (T) 

□ Parallel addressing modes (P) 

□ Conditional-branch addressing modes (B) 

5.2.1 General Addressing Modes 

Instructions that use the general addressing modes are general-purpose in- 
structions, such as ADDI, MPYF, and LSH. Such instructions usually have this 
form: 

dst operation src dst 

where the destination operand is signified by dst and the source operand by 
src, operation defines an operation to be performed on the operands using the 
general addressing modes. Bits 31 -29 are 0, indicating general addressing 
mode instructions. Bits 22 and 21 specify the general addressing mode (G) 
field, which defines how bits 1 5-0 are to be interpreted for addressing the src 
operand. 

Options for bits 22 and 21 (G field) are as follows: 

0 0 register (all CPU registers unless specified otherwise) 

0 1 direct 

1 0 indirect 
1 1 immediate 

If the src and dst fields contain register specifications, the value in these fields 
contains the CPU register addresses as defined by Table 5-1 on page 5-3. 
For the general addressing modes, the following values of ARn are valid: 

ARn, 0 as n ss 7 

Figure 5-4 shows the encoding for the general addressing modes. The nota- 
tion mod indicates the modification field that goes with the ARn field. Refer to 
Table 5-2 on page 5-6 for further information. 
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Figure 5-4. Encoding for General Addressing Modes 

31 29 28 23 22 21 20 1615 11 10 8 7 5 4 0 



0 0 0 


operation 


0 0 


dst 


00000000000 | src 


0 0 0 


operation 


0 1 


dst 


direct 


0 0 0 


operation 


1 0 


dst 


modn ARn disp 


0 0 0 


operation 


1 1 


dst 


immediate 



Q Destination Source Operands 



5.2.2 Three-Operand Addressing Modes 

Instructions that use the three-operand addressing modes, such as 
ADDI3, LSH3, CMPF3. or XOR3, usually have this form: 

SRC1 operation SRC2-* dst 

where the destination operand is signified by dst and the source operands by 
SRC1 and SRC2; operation defines an operation to be performed. Note that 
the 3 can be omitted from three-operand instructions. 

Bits 31-29 are set to the value of 001 , indicating three-operand addressing 
mode instructions. Bits 22 and 21 specify the three-operand addressing mode 
(T) field, which defines how bits 15-0 are to be interpreted for addressing the 
SRC operands. Bits 1 5-8 define the SRC1 address; bits 7-0 define the SRC2 
address. Options for bits 22 and 21 (T) are as follows: 



T SRC1 SRC2 

0 0 register register 

0 1 indirect register 

1 0 register indirect 
1 1 indirect indirect 



Figure 5-5 shows the encoding for three-operand addressing. If the SRC1 
and SRC2 fields use the same auxiliary register, both addresses are correctly 
generated. However, only the value created by the SRC1 field is saved in the 
auxiliary register specified. The assembler issues a warning if you specify this 
condition. 

The following values of ARn and ARm are valid: 

ARn,0*nss 7 
ARm,0 * m * 7 
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The notation modm or modn indicates that the modification field goes with the 
ARm or ARn field, respectively. Refer to Table 5-2 on page 5-6 for further 
information. 

In indirect addressing of the three-operand addressing mode, displacements 
(if used) are allowed to be 0 or 1 , and the index registers (IRO and IR1) can be 
used. The displacement of 1 is implied and is not explicitly coded in the instruc- 
tion word. 

Figure 5-5. Encoding for Three-Operand Addressing Modes 



31 29 28 23 22 21 20 16 15 13 12 11 10 8 7 5 4 3 2 0 



0 0 1 


operation 


0 0 


dst 


0 0 0 | srrt 


0 0 0 


src2 


0 0 1 


operation 


0 1 


dst 


modn 1 ARn 


0 0 0 


src2 


0 0 1 


operation 


1 0 


dst 


0 0 0 1 srd 


modn 


ARn 


0 0 1 


operation 


1 1 


dst 


modn | ARn 


modm 


ARm 



| T | | SRC1 | SRC2 



5.2.3 Parallel Addressing Modes 

Instructions that use parallel addressing, indicated by || (two vertical bars), al- 
low the most parallelism possible. The destination operands are indicated as 
d1 and d2, signifying ofsfl and dst2, respectively (see Figure 5-6). The source 
operands, signified by srd and src2, use the extended-precision registers. 
Operation refers to the parallel operation to be performed. 

Figure 5-6. Encoding for Parallel Addressing Modes 

31 3029 26 25 2423 22 21 19 18 1615 1011 8 7 3 2 0 

| 1 0 | operation | P | d1 | d2 | srd \ src2 | modn | ARn | modm | ARm | 

src3 srcA 
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The parallel addressing mode (P) field specifies how the operands are to be 
used, that is, whether they are source or destination. The specific relationship 
between the P field and the operands is detailed in the description of the indi- 
vidual parallel instructions (see Chapter 10). However, the operands are al- 
ways encoded in the same way. Bits 31 and 30 are set to the value of 1 0, indi- 
cating parallel addressing mode instructions. Bits 25 and 24 specify the paral- 
lel addressing mode (P) field, which defines how bits 21 -0 are to be interpreted 
for addressing the src operands. Bits 21-19 define the srcl address, bits 
18-16 define the src2 address, bits 15-8 the src3 address, and bits 7-0 the 
src 4 address. The notations modn and modm indicate which modification field 
goes with which ARn or ARm (auxiliary register) field, respectively. Following 
is a list of the parallel addressing operands: 



□ 




0 * srd £ 7 (extended-precision registers R0-R7) 


□ 


src2 


0 s src2. s 7 (extended-precision registers R0-R7) 


□ 


d1 


IfO, dsn isRO. If 1 , cfe/1 isR1. 


□ 


d2 


IfO, dsCisR2. If 1,dsCisR3. 


a 


P 


Os Ps3 


□ 


src3 


indirect (disp = 0, 1 , IRO, IR1) 


□ 


srcA 


indirect (disp = 0, 1, IRO, IR1) 



As in the three-operand addressing mode, indirect addressing in the parallel 
addressing mode allows for displacements of 0 or 1 and the use of the index 
registers (IRO and IR1). The displacement of 1 is implied and is not explicitly 
coded in the instruction word. 

In the encoding shown for this mode in Figure 5-6 on page 5-21 , if the src3 
and srcA fields use the same auxiliary register, both addresses are correctly 
generated, but only the value created by the src3 field is saved in the auxiliary 
register specified. The assembler issues a warning if you specify this condi- 
tion. 
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5.2.4 Conditional-Branch Addressing Modes 

Instructions using the conditional-branch addressing modes (Bcond, BcondD, 
CALLcond, DBcond, and DBcondD) can perform a variety of conditional oper- 
ations. Bits 31-27 are set to the value of 01 1 01 , indicating conditional-branch 
addressing mode instructions. Bit 26 is set to 0 or 1 ; 0 selects DBcond, 1 se- 
lects Bcond. Selection of bit 25 determines the conditional-branch addressing 
mode (B). If B = 0, register addressing is used; if B = 1 , PC-relative addressing 
is used. Selection of bit 21 sets the type of branch: D = 0 for a standard branch 
or D = 1 for a delayed branch. The condition field (cond) specifies the condition 
checked to determine what action to take, that is, whether to branch (see 
Chapter 10 for a list of condition codes). Figure 5-7 shows the encoding for 
conditional-branch addressing. 

Figure 5-7. Encoding for Conditional-Branch Addressing Modes 
DBcond (D): 



31 


27 26 


25 


24 22 21 20 




16 


15 




5 4 




0 


0 110 


1 1 


B 


ARn 


D 


cond 


0 0 0 


0 


0 0 0 0 0 0 0 | 


src reg 




0 110 


1 1 


B 


ARn 


D 


cond 


immediate (PC relative) 


Bcond (D): 


























31 


27 26 


25 


24 22 21 20 




16 


15 




5 4 




0 


0 110 


1 0 


B 


0 0 0 


D 


cond 


0 0 0 


0 


0 0 0 0 0 0 0 | 


src reg 




0 110 


1 0 


B 


0 0 0 


D 


cond 


immediate (PC relative) 


CALLconof: 


























31 


27 26 


25 


24 22 21 20 




16 


15 




5 4 




0 


0 111 


0 0 


B 


0 0 0 


0 


cond 


0 0 0 


0 


0 0 0 0 0 0 0 | 


src reg 




0 111 


0 0 


B 


0 0 0 


0 


cond 


immediate (PC relative) 
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5.3 Circular Addressing 

Many algorithms, such as convolution and correlation, require the implemen- 
tation of a circular buffer in memory. In convolution and correlation, the circular 
buffer is used to implement a sliding window that contains the most recent data 
to be processed. As new data is brought in, the new data overwrites the oldest 
data. Key to the implementation of a circular buffer is the implementation of a 
circular addressing mode. This section describes the circular addressing 
mode of the TMS320C3x. 

The block size register (BK) specifies the size of the circular buffer. By labeling 
the most significant 1 of the BK register as bit N, with N ^ 1 5, you can find the 
address immediately following the bottom of the circular buffer by concatenat- 
ing bits 31 through N + 1 of a user-selected register (ARn) with bits N through 
0 of the BK register. The address of the top of the buffer is referred to as the 
effective base (EB) and can be found by concatenating bits 31 through N + 1 
of ARn, with bits N through 0 of EB being 0. 

Figure 5-8 illustrates the relationships between the block size register (BK), 
the auxiliary registers (ARn), the bottom of the circular buffer, the top of the cir- 
cular buffer, and the index into the circular buffer. 

A circular buffer of size R must start on a K-bit boundary (that is, the K LSBs 
of the starting address of the circular buffer must be 0), where K is an integer 
that satisfies 2 K > R. Since the value R must be loaded into the BK register, 
K > N + 1 . For example, a 31 -word circular buffer must start at an address 
whose five LSBs are 0 (that is, XXXXXXXXXXXXXXXXXXXXXXXXXXXOOOOO2), 
and the value 31 must be loaded into the BK register. 
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Figure 5-8. Flowchart for Circular Addressing 
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In circular addressing, index refers to the N LSBs of the auxiliary register se- 
lected, and step is the quantity being added to or subtracted from the auxiliary 
register. Follow these two rules when you use circular addressing: 

□ The step used must be less than or equal to the block size. The step size 
is treated as an unsigned integer. 

□ The first time the circular queue is addressed, the auxiliary register must 
be pointing to an element in the circular queue. 

The algorithm for circular addressing is as follows: 

If 0 * index + step < BK: 
index = index + step. 

Else if index + step * BK: 

index = index + step - BK. 

Else if index + step < 0: 

index = index + step + BK. 

Figure 5-9 shows how the circular buffer is implemented and illustrates the re- 
lationship of the quantities generated and the elements in the circular buffer. 

Figure 5-9. Circular Buffer Implementation 
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Example 5-23 shows circular addressing operation. Assuming that all ARs 
are four bits, let ARO = 0000, and BK = 01 1 0 (block size of 6). Example 5-23 
shows a sequence of modifications and the resulting value of ARO. 
Example 5-23 also shows how the pointer steps through the circular queue 
with a variety of step sizes (both incrementing and decrementing). 

Example 5-23. Circular Addressing 



*AR0++(5)% 
*AR0++(2)% 
*AR0-- (3)% 
*AR0++(6)% 
*AR0 — % 
*AR0 



ARO = 0 (0th value) 

ARO = 5 (1st value) 

ARO = 1 (2nd value) 

ARO = 4 (3rd value) 

ARO = 4 (4th value) 

ARO = 3 (5th value) 



Value 

0th 
2nd 



Data 



5th 
4th, 3rd 
1st 



Element 0 



Element 1 



Element 2 



Element 3 



Element 4 



Element 5 (Last Element) 



Last Element + 1 



Address 

0 
1 
2 
3 
4 
5 
6 
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Circular addressing is especially useful for the implementation of FIR filters. 
Figure 5-1 0 shows one possible data structure for Fl R filters. Note that the ini- 
tial value of ARO points to h(N-1), and the initial value of AR1 points to x(0). 
Circular addressing is used in the TMS320C3x code for the FIR filter shown 
in Example 5-24. 



Figure 5-10. Data Structure for FIR Filters 



ARO 



Impulse Response 
h(N-1) 
h(N-2) 



Input Samples 
x(N-1) 
x(N-2) 



h(2) 
h(0) 



x(2) 

x(1) 
x(0) 



AR1 



Example 5-24. FIR Filter Code Using Circular Addressing 



Initialization 



LDI 
LDI 
LDI 



TOP LDF 
STF 

LDF 
LDF 



N,BK 

H,AR0 

X,AR1 



IN, R3 
R3, *ARl++% 



0,R0 
0,R2 



Filter 

RPTS 
MPYF3 
ADDF3 
ADDF 



; Load block size. 
; Load pointer to impulse response. 
;Load pointer to bottom of input 
; sample buffer. 

;Read input sample. 

; Store with other samples, 

;and point to top of buffer. 

; Initialize RO. 

; Initialize R2. 



N-l 

*AR0++% , *ARl++% , RO 



R0,R2,R2 
R0,R2 



; Repeat next instruction. 

; Multiply and accumulate. 
;Last product accumulated. 



STF 
B 



R2,Y 
TOP 



;Save result. 
; Repeat • 
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5.4 Bit-Reversed Addressing 

Bit-reversed addressing on the TMS320C3x enhances execution speed and 
program memory for FFT algorithms that use a variety of radices. The base 
address of bit-reversed addressing must be located on a boundary of the size 
of the table. For example, if IRO = 2 n ~ 1 , the n LSBs of the base address must 
be 0. The base address of the data in memory must be on a 2 n boundary. One 
auxiliary register points to the physical location of a data value. IRO specifies 
one-half the size of the FFT; that is, the value contained in IRO must be equal 
to 2 n ~ 1 , where n is an integer and the FFT size is 2 n . When you add IRO to the 
auxiliary register by using bit-reversed addressing, addresses are generated 
in a bit-reversed fashion. 

To illustrate this kind of addressing, assume eight-bit auxiliary registers. Let 
AR2 contain the value 01 1 0 0000 (96). This is the base address of the data in 
memory. Let IRO contain the value 0000 1 000 (8). Example 5-25 shows a se- 
quence of modifications of AR2 and the resulting values of AR2. 

Example 5-25. Bit-Reversed Addressing 



*AR2++(IR0)B 


; AR2 




0110 


0000 


(0th 


value ) 


*AR2++(IR0)B , 


; AR2 




0110 


1000 


(1st 


value ) 


*AR2++(IR0)B t 


• AR2 




0110 


0100 


(2nd 


value ) 


*AR2++(IR0)B , 


• AR2 


xs 


0110 


1100 


(3rd 


value ) 


*AR2++(IR0)B , 


AR2 




0110 


0010 


(4th 


value ) 


*AR2++(IR0)B j 


AR2 




0110 


1010 


(5th 


value ) 


*AR2++(IR0)B j 


AR2 




0110 


0110 


(6th 


value ) 


*AR2 j 


AR2 




0110 


1110 


(7th 


value ) 



Table 5-3 shows the relationship of the index steps and the four LSBs of AR2. 
You can find the four LSBs by reversing the bit pattern of the steps. 
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Table 5-3. Index Steps and Bit-Reversed Addressing 



Step 


Bit Pattern 


Bit-Reversed Pattern 


Bit-Reversed Step 


0 


0000 


0000 


0 


1 


0001 


1000 


8 


2 


0010 


0100 


4 


3 


0011 


1100 


12 


4 


0100 


0010 


2 


5 


0101 


1010 


10 


6 


0110 


0110 


6 


7 


0111 


1110 


14 


8 


1000 


0001 


1 


g 


1001 


1001 


9 


10 


1010 


0101 


5 


11 


1011 


1101 


13 


12 


1100 


0011 


3 


13 


1101 


1011 


11 


14 


1110 


0111 


7 


15 


1111 


1111 


15 
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5.5 System and User Stack Management 

The TMS320C3x provides a dedicated system stack pointer (SP) for building 
stacks in memory. The auxiliary registers can also be used to build a variety 
of more general linear lists. This section discusses the implementation of the 
following types of linear lists: 

□ Stack 

The stack is a linear list for which all insertions and deletions are made at 
one end of the list. 

□ Queue 

The queue is a linear list for which all insertions are made at one end of the 
list and all deletions are made at the other end. 

□ Dequeue 

The dequeue is a double-ended queue linear list for which insertions and 
deletions are made at either end of the list. 



5.5.1 System Stack Pointer 

The system stack pointer (SP) is a 32-bit register that contains the address of 
the top of the system stack. The system stack fills from low-memory address 
to high-memory address (see Figure 5-11). The SP always points to the last 
element pushed onto the stack. A push performs a preincrement, and a pop 
performs a postdecrement of the system stack pointer. 

The program counter is pushed onto the system stack on subroutine calls, 
traps, and interrupts. It is popped from the system stack on returns. The sys- 
tem stack can be pushed and popped using the PUSH, POP, PUSHF, and 
POPF instructions. 



Figure 5-11. System Stack Configuration 



Low Memory 



SP 



Bottom of Stack 



Top of Stack 



(Free) 



High Memory 



Addressing 5-31 



System and User Stack Management 



5.5.2 Stacks 

Stacks can be built from low to high memory or high to low memory. Two cases 
for each type of stack are shown. Stacks can be built using the preincrement/ 
decrement and postincrement/decrement modes of modifying the auxiliary 
registers (AR). Stack growth from high-to-low memory can be implemented in 
two ways: 

CASE 1 : Stores to memory using * — ARn to push data onto the stack and 
reads from memory using *ARn++ to pop data off the stack. 

CASE 2: Stores to memory using *ARn — to push data onto the stack and 
reads from memory using * ++ ARn to pop data off the stack. 

Figure 5-12 illustrates these two cases. The only difference is that in case 1 , 
the AR always points to the top of the stack, and in case 2, the AR always points 
to the next free location on the stack. 



Figure 5-12. Implementations of High-to-Low Memory Stacks 



ARn 



Case 1 
Low Memory 



(Free) 



Top of Stack 



Bottom of Stack 



High Memory 



ARn 



Case 2 
Low Memory 



(Free) 



Top of Stack 



Bottom of Stack 



High Memory 



Stack growth from low-to-high memory can be implemented in two ways: 

CASE 3: Stores to memory using *++ ARn to push data onto the stack and 
reads from memory using *ARn — to pop data off the stack. 

CASE 4: Stores to memory using *ARn++ to push data onto the stack and 
reads from memory using * — ARn to pop data off the stack. 

Figure 5-1 3 shows these two cases. In case 3, the AR always points to the top 
of the stack. In case 4, the AR always points to the next free location on the 
stack. 
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Figure 5-13. Implementations of Low-to-High Memory Stacks 



ARn -* 



Case 3 
Low Memory 



Bottom of Stack 



Top of Stack 



(Free) 



High Memory 



ARn -* 



Case 4 
Low Memory 



Bottom of Stack 



Top of Stack 



(Free) 



High Memory 



5.5.3 Queues 

A queue is like a FIFO. The implementation of queues is based on the manipu- 
lation of auxiliary registers. Two auxiliary registers are used: one to mark the 
front of the queue from which data is popped (or dequeued) and the other to 
mark the rear of the queue where data is pushed. With proper management 
of the auxiliary registers, the queue can also be circular. (A queue is circular 
when the rear pointer is allowed to point to the beginning of the queue memory 
after it has pointed to the end of the queue memory.) 
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The TMS320C3x provides a complete set of constructs that facilitate software 
and hardware control of the program flow. Software control includes repeats, 
branches, calls, traps, and returns. Hardware control includes operations, 
reset, and interrupts. Because programming includes a variety of constructs, 
you can select the one suited for your particular application. 

Several interlocked operations instructions provide flexible multiprocessor 
support and, through the use of external signals, a powerful means of 
synchronization. They also guarantee the integrity of the communication and 
result in a high-speed operation. 

The TMS320C3x supports a nonmaskable external reset signal and a number 
of internal and external interrupts. These functions can be programmed for a 



particular application. 

This chapter discusses the following major topics: 

Topic Page 

6.1 Repeat Modes .... 6-2 

6.2 Delayed Branches 6-8 

6.3 Calls, Traps, and Returns * . ...... . ........ 6-10 

6.4 Interlocked Operations 6-12 

6.5 Reset Operation . . 6-18 

6.6 Interrupts . 6-23 

6.7 TMS320LC31 Power Management Modes 6-36 
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6.1 Repeat Modes 

The repeat modes of the TMS320C3x can implement zero-overhead looping. 
For many algorithms, most execution time is spent in an inner kernel of code. 
Using the repeat modes allows these time-critical sections of code to be ex- 
ecuted in the shortest possible time. 

The TMS320C3x provides two instructions to support zero-overhead looping: 

□ RPTB (repeat a block of code) . RPTB repeats execution of a block of code 
a specified number of times. 

□ RPTS (repeat a single instruction) . RPTS fetches a single instruction once 
and then repeats its execution a number of times. Since the instruction is 
fetched only once, bus traffic is minimized. 

RPTB and RPTS are four-cycle instructions. These four cycles of overhead 
occur during the initial execution of the loop. All subsequent executions of the 
loop have no overhead (zero cycle). 

Three registers (RS, RE, and RC) are associated with the updating of the pro- 
gram counter (PC) when it is updated in a repeat mode. Table 6-1 describes 
these registers. 

Table 6-1. Repeat-Mode Registers 

Register Function 

RS Repeat Start Address Register. Molds the address of the first instruc- 
tion of the block of code to be repeated. 

RE Repeat End Address Register. Holds the address of the last instruc- 
tion of the block of code to be repeated. 

RC Repeat Count Register. Contains one less than the number of times 
the block remains to be repeated. For example, to execute a block 
N times, load N-1 into RC. 



For correct operation of the repeat modes, you must correctly initialize all of 
the above-mentioned registers. 
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6.1.1 Repeat-Mode Control Bits 

Two bits are important to the operation of RPTB and RPTS: 

□ RM bit. The repeat-mode flag (RM) bit in the status register specifies 
whether the processor is running in the repeat mode. 

■ RM = 0 indicates standard instruction fetching mode. 

■ RM = 1 indicates repeat-mode instruction fetches. 

□ S bit. The S bit is internal to the processor and cannot be programmed, 
but this bit is necessary to fully describe the operation of RPTB and RPTS. 

■ S = 0 indicates standard instruction fetches. 

■ S = 1 and RM = 1 indicates repeat-single instruction fetches. 



6.1 .2 Repeat-Mode Operation 

Information in the repeat-mode registers and associated control bits controls 
the modification of the PC during repeat-mode fetches. The repeat modes 
compare the contents of the RE register (repeat end address register) with the 
PC after the execution of each instruction. If they match and the repeat counter 
(RC) is nonnegative, the RC is decremented, the PC is loaded with the repeat 
start address, and the processing continues. The fetches and appropriate sta- 
tus bits are modified as necessary. Note that the RC is never modified when 
the RM flag is 0. 

The repeat counter should be loaded with a value one less than the number 
of times to execute the block; for example, an RC value of 4 would execute the 
block five times. The detailed algorithm for the update of the PC is shown in 
Example 6-1 . 



i 1 

Note: Maximum Number of Repeats 

The maximum number of repeats occurs when RC = 8000 OOOOh. This re- 
sults in 8000 0001 h repetitions. The minimum number of repeats occurs 
when RC = 0. This results in one repetition. 

RE should be greater than or equal to RS (RE * RS). Otherwise, the code 
will not repeat even though the RM bit remains set to 1 . 

By writing a 0 into the repeat counter or writing 0 into the RM bit of the status 
register, you can stop the repeating of the loop before completion. 
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Example 6-1. Repeat-Mode Control Algorithm 



if RM == 1 
if S == 1 

if first time through 
fetch instruction from memory 

else 

fetch instruction from IR 
RC - 1 RC 
if RC < 0 

0 -* ST(RM) 
0 -* S 

PC + 1 PC 
else if S == 0 

fetch instruction from memory 
if PC == RE 

RC - 1 -* RC 
if RC * 0 

RS -* PC 
else if RC < 0 

0 -> ST(RM) 

0 -* S 

PC + 1 — * PC 



If in repeat mode (RPTB or RPTS) 
If RPTS 

If this is the first fetch 

Fetch instruction from memory 

If not the first fetch 

Fetch instruction from IR 

Decrement RC 

If RC is negative 

Repeat single mode completed 

Turn off repeat -mode bit 

Clear S 

Increment PC 

If RPTB 

Fetch instruction from memory 

If this is the end of the block 

Decrement RC 

If RC is not negative 

Set PC to start of block 

If RC is negative 

Turn off repeat mode bits 

Clear S 

Increment PC 



6.1 .3 RPTB Instruction 

The RPTB instruction repeats a block of code a specified number of times. 

The number of times to repeat the block is the RC (repeat count) register value 
plus one. Because the execution of RPTB does not load the RC, you must load 
this register yourself. The RC register must be loaded before the RPTB instruc- 
tion is executed. A typical setup of the block repeat operation is shown in 
Example 6-2. 

Example 6-2. RPTB Operation 

LDI 15, RC ; Load repeat counter with 15 

RPTB ENDL00P ; Execute the block of code 

STLOOP ; from STLOOP to ENDLOOP 16 times 



ENDLOOP 
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Using the repeat-block mode of modifying the PC facilitates analysis of what 
would happen in the case of branches within the block. Assume that the next 
value of the PC will be either PC + 1 or the contents of the RS register. It is thus 
apparent that this method of block repeat allows much branching within the 
repeated block. Execution can go anywhere within the user's code via inter- 
rupts, subroutine calls, etc. For proper modification of the loop counter, the last 
instruction of the loop must be fetched. You can stop the repeating of the loop 
prior to completion by writing a 0 to the repeat counter or writing a 0 to the RM 
bit of the status register. 

6.1.4 RPTS Instruction 

An RPTS src instruction repeats the instruction following the RPTS src + 1 
times. Repeats of a single instruction initiated by RPTS are not interruptible, 
because the RPTS fetches the instruction word only once and then keeps it 
in the instruction register for reuse. An interrupt would cause the instruction 
word to be lost. Refetching the instruction word from the instruction register 
reduces memory accesses and, in effect, acts as a one-word program cache. 
If you need a single instruction that is repeatable and interruptible, you can use 
the RPTB instruction. 

When RPTS src is executed, the following sequence of operations occurs: 

1) PC + 1-*RS 

2) PC + 1 RE 

3) 1 RM status register bit 

4) 1 -> S bit 

5) src -* RC (repeat count register) 

The RPTS instruction loads all registers and mode bits necessary for the oper- 
ation of the single-instruction repeat mode. Step 1 loads the start address of 
the block into RS. Step 2 loads the end address into the RE (end address of 
the block). Since this is a repeat of a single instruction, the start address and 
the end address are the same. Step 3 sets the status register to indicate the 
repeat mode of operation. Step 4 indicates that this is the repeat single-instruc- 
tion mode of operation. Step 5 loads src into RC. 
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6.1.5 Repeat-Mode Restrictions 



Since the block repeat modes modify the program counter, other instructions 
cannot modify the program counter at the same time. There are two restric- 
tions: 

□ The last instruction in the block (or the only instruction in a block of 
size 1) cannot be a Bcond, BR, DBcond, CALL, CALLconcf, TRAP concf, 
RETIcond, RETScond, IDLE, RPTB, or RPTS. Example 6-3 shows an in- 
correctly placed standard branch. 

□ None of the last four instructions from the bottom of the block (or the only 
instruction in a block of size 1) can be a BcondD, BRD, or DB condD. 
Example 6-4 shows an incorrectly placed delayed branch. 

i 1 

Note: Rule Violation 

If either of these rules is violated, the PC will be undefined. 



Example 6-3. Incorrectly Placed Standard Branch 



LDI 
RPTB 



15, RC 
ENDLOOP 



STLOOP 



Load repeat counter with 15 

Execute the block of code 

from STLOOP to ENDLOOP 16 times 



ENDLOOP BR OOPS 

Example 6-4. Incorrectly Placed Delayed Branch 



LDI 
RPTB 



15, RC 
ENDLOOP 



STLOOP 



This branch violates rule 1 



Load repeat counter with 15 

Execute block of code 

from STLOOP to ENDLOOP 16 times 



BRD OOPS ; This branch violates rule 2 

ADDF 

MPYF 

ENDLOOP SUBF 

6.1 .6 RC Register Value After Repeat Mode Completes 

For the RPTB instruction, the RC register normally decrements to 0000 OOOOh 
unless the block size is 1 ; in that case, it decrements to FFFF FFFFh. However, 
if the RPTB instruction using a block size of 1 has a pipeline conflict in the 
instruction being executed, the RC register decrements to 0000 OOOOh. 
Example 6-5 illustrates a pipeline conflict. Refer to Chapter 9 for pipeline in- 
formation. 
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RPTS normally decrements the RC register to FFFF FFFFh. However, if the 
RPTS has a pipeline conflict on the last cycle, the RC register decrements to 

0000 OOOOh. 

1 i 
Note: Number of Repetitions 

In any case, the number of repetitions is always RC + 1 . 



Example 6-5. Pipeline Conflict in an RPTB Instruction 



EDC 



.word40000000h ; The program is located in 4000000Fh 



LDP EDC 
LDI @EDC, ARO 
LDI 15, RC 
RPTB ENDLOOP 
ENDLOOPLDI *AR0,R0 



Load repeat counter with 15 

Execute block of code 

The *AR0 read conflicts with 

the instruction fetching 

Then RC decrements to 0 

If cache is enabled, RC decrements 

to FFFF FFFFh 



6.1 .7 Nested Block Repeats 

Block repeats (RPTB) can be nested. Since the registers RS, RE, RC, and ST 
control the repeat-mode status, these registers must be saved and restored 
in order to nest block repeats. For example, if you write an interrupt service 
routine that requires the use of RPTB, it is possible that the interrupt asso- 
ciated with the routine may occur during repeated execution of a block. The 
interrupt service routine can check the RM bit to determine whether the block 
repeat mode is active. If this RM is set, the interrupt routine should save ST, 
RS, RE, and RC, in that order. The interrupt routine can then perform a block 
repeat. Before returning to the interrupted routine, the interrupt routine should 
restore RC, RE, RS, and ST, in that order. If the RM bit is not set, you don't need 
to save and restore these registers. 

The order in which the registers are saved/restored is important to guarantee 
correct operation. The ST register should be restored last, after the RC, RE, 
and RS registers. ST should be restored after restoring RC, because the RM 
bit cannot be set to 1 if the RC register is 0 or-1 . For this reason, if you execute 
a POP ST instruction (with ST (RM bit) = 1) while RC = 0, the POP instruction 
recovers all the ST register bits but not the RM bit that stays at 0 (repeat mode 
disabled). Also, RS and RE should be correctly set before you activate the re- 
peat mode. 

The RPTS instruction can be used in a block repeat loop if the proper registers 
are saved. 
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6.2 Delayed Branches 

The TMS320C3x offers three main types of branching: standard, delayed, and 
conditional delayed. 

Standard branches empty the pipeline before performing the branch; this 
guarantees correct management of the program counter and results in a 
TMS320C3x branch taking four cycles. Included in this class are repeats, 
calls, returns, and traps. 

Delayed branches on the TMS320C3x do not empty the pipeline, but rather 
guarantee that the next three instructions will execute before the program 
counter is modified by the branch. The result is a branch that requires only a 
single cycle, thus making the speed of the delayed branch very close to that 
of the optimal block repeat modes of the TMS320C3x. However, unlike block 
repeat modes, delayed branches may be used in situations other than looping. 
Every delayed branch has a standard branch counterpart that is used when 
a delayed branch cannot be used. The delayed branches of the TMS320C3x 
are BcondD, BRD, and DBcondD. 

Conditional delayed branches use the conditions that exist at the end of the 
instruction immediately preceding the delayed branch. They do not depend on 
the instructions following the delayed branch. The condition flags are set by 
a previous instruction only when the destination register is one of the exten- 
ded-precision registers (R0-R7) or when one of the compare instructions 
(CMPF, CMPF3, CMPI, CMPI3, TSTB, or TSTB3) is executed. Delayed 
branches guarantee that the next three instructions will execute, regardless 
of other pipeline conflicts. 

When a delayed branch is fetched, it remains pending until the three subse- 
quent instructions are executed. None of the three instructions that follow a 
delayed branch can be any of the following (see Example 6-6): 



Bcond 


DBcondD 


BcondD 


IDLE 


BR 


RETI cond 


BRD 


RETScond 


CALL 


RPTB 


CALLcond 


RPTS 


DBcond 


TRAP cond 



Delayed branches disable interrupts until the three instructions following the 
delayed branch are completed. This is independent of whether the branch is 
taken. 
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Note: Incorrect Use of Delayed Branches 

If delayed branches are used incorrectly the PC will be undefined, 
i i 

Example 6-6. Incorrectly Placed Delayed Branches 

Bl: BD LI 

NOP 
NOP 

B2: B L2 ; This branch is incorrectly placed. 

NOP 
NOP 
NOP 
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6.3 Calls, Traps, and Returns 

Calls and traps provide a means of executing a subroutine or function while 
providing a return to the calling routine. 

The CALL, CALLcond, and TRAP cond instructions store the value of the PC 
on the stack before changing the PC's contents. The stack thus provides a re- 
turn using either the RETScond or RETI cond instruction. 

□ The CALL instruction places the next PC value on the stack and places 
the src (source) operand into the PC. The src is a 24-bit immediate value. 
Figure 6-1 shows CALL response timing. 

□ The CALLcond instruction is similar to the CALL instruction (above) ex- 
cept for the following: 

■ It executes only if a specific condition is true (the 20 conditions — in- 
cluding unconditional — are listed in Table 10-9 on page 10-13). 

■ The src is either a PC-relative displacement or is in register-addres- 
sing mode. 

The condition flags are set by a previous instruction only when the destina- 
tion register is one of the extended-precision registers (R0-R7) or when 
one of the compare instructions (CMPF, CMPF3, CMPI, CMPI3, TSTB, or 
TSTB3) is executed. 

□ The TRAPcondinstruction also executes only if a specific condition is true 
(same conditions as for the CALLcond instruction). When executing, the 
following actions occur: 

1 ) Interrupts are disabled with 0 written to bit GIE of the ST. 

2) The next PC value is stored on the stack. 

3) A vector is retrieved from one of the addresses 20h to 3Fh and is 
loaded into the PC. 

The particular address is identified by a trap number in the instruction. 
Using the RETIcondto return re-enables interrupts. 

□ RETScond returns execution from any of the above three instructions by 
popping the top of the stack to the PC. To execute, the specified condition 
must be true. Conditions are the same as for the CALLcond instruction. 

□ RETIcond returns from traps or calls like the RETScond (above) with the 
addition that RETI cond also sets the GIE bit of the status register, which 
enables all interrupts whose enabling bit is set to 1 . Conditions are the 
same as for the CALLcond instruction. 
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Calls and traps accomplish the same functional task (that is, a subfunction is 
called and executed, and control is then returned to the calling function). Traps 
offer several advantages. Among them are the following: 

□ Interrupts are automatically disabled when a trap is executed. This allows 
critical code to execute without risk of being interrupted. Thus, traps are 
generally terminated with a RETI cond instruction to re-enable interrupts. 

□ You can use traps to indirectly call functions. This is particularly beneficial 
when a kernel of code contains the basic subf unctions to be used by appli- 
cations. In this case, the functions in the kernel can be modified and relo- 
cated without the need to recompile each application. 

Figure 6-1. CALL Response Timing 



Fetch CALL Decode CALL Read CALL Execute CALL Fetch First 

(Store PC . Subroutine 
I on Stack) I Instruction 
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6.4 Interlocked Operations 

Among the most common multiprocessing configurations is the sharing of 
global memory by multiple processors. In order for multiple processors to ac- 
cess this global memory and share data in a coherent manner, some sort of 
arbitration or handshaking is necessary. This requirement for arbitration is the 
purpose of the TMS320C3x interlocked operations. 

The TMS320C3x provides a flexible means of multiprocessor support with five 
instructions, referred to as interlocked operations. Through the use of external 
signals, these instructions provide powerful synchronization mechanisms. 
They also guarantee the integrity of the communication and result in a high- 
speed operation. The interlocked-operation instruction group is listed in 
Table 6-2. 

Table 6-2. Interlocked Operations 



Mnemonic 


Description 


Operation 


LDFI 


Load floating-point value into a register, 
interlocked 


Signal interlocked 
src dst 


LDII 


Load integer into a register, interlocked 


Signal interlocked 
src dst 


SIQI 


Signal, interlocked 


Signal interlocked 
Clear interlock 


STFI 


Store floating-point value to memory, 
interlocked 


src -* dst 
Clear interlock 


STII 


Store integer to memory, interlocked 


src -* dst 
Clear interlock 



The interlocked operations use the two external flag pins, XFO and XF1 . XFO 
must be configured as an output pin; XF1 is an input pin. When configured in 
this manner, XFO signals an interlock operation request, and XF1 acts as an 
acknowledge signal for the requested interlocked operation. In this mode, XFO 
and XF1 are treated as active-low signals. 



The external timing for the interlocked loads and stores is the same as for stan- 
dard loads and stores. The interlocked loads and stores may be extended like 
stand ard accesses by using the appropriate ready signal (RDYj n t or XRDYj nt ) . 
(RDYj n t and XRDY, nt are a combination of external ready input and software 
wait states. Refer to Chapter 7, External Bus Operation, for more information 
on ready generation.) 
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The LDFI and LDII instructions perform the following actions: 

1 ) Simultaneously set XFO to 0 and begin a read cycle. The timing of XFO is 
similar to that of the address bus during a read cycle. 

2) Execute an LDF or LDI instruction and extend the read cycle until XF1 is 
set to 0 and a ready (RDYj nt or XRDYj nt ) is signaled. 

3) Leave XFO set to 0 and end the read cycle. 

The read/write operation is identical to any other read/write cycle except for 
the special use of XFO and XF1 . The src operand for LDFI and LDII is always 
a direct or indir ect memory add res s. XFO is set to 0 only if the src is located 
off-chip; that is, STRB, MSTRB, or IOSTRB is active, or the src is one of the 
on-chip peripherals. If on-chip memory is accessed, then XFO is not asserted, 
and the operation is as an LDF or LDI from internal memory. 

The STFI and STII instructions perform the following operations: 

1 ) Simultaneously set XFO to 1 and begin a write cycle. The timing of XFO is 
similar to that of the address bus during a write cycle. 

2) Execu te an STF o r STI instruction and extend the write cycle until a ready 
(RDY int or XRDY int ) is signaled. 

As in the case for LDFI and LDII, th e dst of STF I and STII affects XFO. If dst 
is located off-chip (STRB, MSTRB, or IOSTRB is active) or the dst is one of 
the on-chip peripherals, XFO is set to 1 . If on-chip memory is accessed, then 
XFO is not asserted and the operations are as an STF or STI to internal 
memory. 

The SIGI instruction functions as follows: 

1) Sets XFO toO. 

2) Idles until XF1 is set toO. 

3) Sets XFO to 1 and ends the operation. 

While the LDFI, LDII, and SIGI instructions are waiting for XF1 to be set to 0, 
you ca n interrupt them. LDFI and LDII require a ready signal (RDYj n t or' 
XRDYj nt ) in order to be interrupted. Because interrupts are taken on bus cycle 
boundaries (see Section 6.6), an interrupt may be taken any time after a valid 
ready. This allows you to implement protection mechanisms against deadlock 
conditions by interrupting an interlocked load that has taken too long. Upon re- 
turn from the interrupt, the next instruction is executed. The STFI and STII 
instructions are not interruptible. Since the STFI and STII instructions com- 
plete when ready is signaled, the delay until an interrupt can occur is the same 
as for any other instruction. 
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Interlocked operations can be used to implement a busy-waiting loop, to 
manipulate a multiprocessor counter, to implement a simple semaphore 
mechanism, or to perform synchronization between two TMS320C3xs. The 
following examples illustrate the usefulness of the interlocked operations in- 
structions. 

Example 6-7 shows the implementation of a busy-waiting loop. If location 
LOCK is the interlock for a critical section of code, and a nonzero means the 
lock is busy, the algorithm for a busy-waiting loop can be used as shown. 



Example 6-7. Busy-Waiting Loop 



LDI 1,R0 

LI: LDII 8L0CK,R1 

STII R0,§LOCK 

BNZ LI 



Put 1 into RO 

Interlocked operation begun 
Contents of LOCK Rl 
Put RO (- 1) into LOCK, XFO 
Interlocked operation ended 
Keep trying until LOCK - 0 



Example 6-8 shows how a location COUNT may contain a count of the num- 
ber of times a particular operation needs to be performed. This operation may 
be performed by any processor in the system. If the count is 0, the processor 
waits until it is nonzero before beginning processing. The example also shows 
the algorithm for modifying COUNT correctly. 



Example 6-8. Multiprocessor Counter Manipulation 



CT: OR 4,I0F 

LDII §COUNT,Rl 

BZ CT 

SUBI 1,R1 



Rl,( 



XFO « 1 

Interlocked operation ended 
Interlocked operation begun 
Contents of COUNT -* Rl 
If COUNT - 0, keep trying 
Decrement Rl (= COUNT) 
Update COUNT , XFO ■ 1 
Interlocked operation ended 



Figure 6-2 illustrates multiple TMS320C3xs sharing global memory and using 
the interlocked instructions as in Example 6-9, Example 6-10, and 
Example 6-11. 
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Figure 6-2. Multiple TMS320C3xs Sharing Global Memory 



Global Memory 



Arbitration Logic 



Lock, Count, or S 



XFO XF1 
TMS320C3x#1 



(X)A 



OOP 



CTRL 



Local 
Memory 



(X)A 



CTRL 



XFO XF1 
TMS320C3X #2 



Local 
Memory 



It might sometimes be necessary for several processors to access some 
shared data or other common resources. The portion of code that must access 
the shared data is called a critical section. 

To ease the programming of critical sections, semaphores may be used. 
Semaphores are variables that can take only non-negative integer values. 
Two primitive, indivisible operations are defined on semaphores (with S being 
a semaphore): 

V(S): s + 1 s 

P(S): P: if (S == 0), go to P 
else S — 1 -* S 

Indivisibility of V(S) and P(S) means that when these processes access and 
modify the semaphore S, they are the only processes accessing and modify- 
ing S. 

To enter a critical section, a P operation is performed on a common sema- 
phore, say S (S is initialized to 1). The first processor performing P(S) will be 
able to enter its critical section. All other processors are blocked because S 
has become 0. After leaving its critical section, the processor performs a V(S), 
thus allowing another processor to execute P(S) successfully 
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The TMS320C3x code for V(S) is shown in Example 6-9; code for P(S) is 
shown in Example 6-10. Compare the code in Example 6-10 to the code in 
Example 6-8. 



Example 6-9. Implementation of V(S) 

V; LDII 8S,R0 
ADD I 1,R0 

stii R0,es 

Example 6-10. Implementation of P(S) 

P; OR 4,IOF 

NOP 

LDII @S,R0 

BZ P 
SUBI 1,R0 
STII R0,@S 



Interlocked read of S begins (XFO = 0) 

Contents of S — > R0 

Increment R0 ( = S ) 

Update S, end interlock (XFO = 0) 



End interlock (XFO - 1) 

Avoid potential pipeline conflicts when 

executing out of cache, on-chip memory 

or zero wait-state memory 

Interlocked read of S begins 

Contents of S — * R0 

If S = 0, go to P and try again 

Decrement R0 (= S) 

Update S, end interlock (XFO = 1) 



The SIGI operation can synchronize, at an instruction level, multiple 
TMS320C3xs. Consider two processors connected as shown in Figure 6-3. 
The code for the two processors is shown in Example 6-11. 



Figure 6-3. Zero-Logic Interconnect of TMS320C3xs 



TMS320C3x#1 




TMS320C3X #2 


XFO 
XF1 


► 


XF1 


-« — 




XFO 



Processor #1 runs until it executes the SIGI. It then waits until processor #2 
executes a SIGI. At this point, the two processors have synchronized and con- 
tinue execution. 
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Example 6-11. Code to Synchronize Two TMS320C3xs at the Software Level 



Time Code for TMS320C3x #1 
0 • 



SIGI 



Code for TMS320C3x #2 



(WAIT) 



- Synchronization Occurs - 



♦ SIGI 



Program Flow Control 6-1 7 



Reset Operation 



6.5 Reset Operation 

The TMS320C3x supports a nonmaskable external reset signal (RESET), 
which is used to perform system reset. This section discusses the reset opera- 
tion. 

At powerup, the state of the TMS320C3x processor is undefined. You can use 
the RESET signal to place the processor in a known state. This signal must 
be asserted low for ten or more H1 clock cycles to guarantee a system reset. 
H1 is an output clock signal generated by the TMS320C3x (see Chapter 13 
for more information). 

Reset affects the other pins on the device in either a synchronous or asynchro- 
nous manner. The synchronous reset is gated by the TMS320C3x's internal 
clocks. The asynchronous reset directly affects the pins and is faster than the 
synchronous reset. Table 6-3 shows the state of the TMS320C3x's pins after 
RESET = 0. Each pin is described according to whether the pin is reset syn- 
chronously or asynchronously. 
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Table &-3. Pin Operation at Reset 



Signal 


#Pins 


Operation at Reset 






Primary Interface (61 Pins) 


D31-D0 


32 


Synchronous reset; placed in high-impedance state 


A23-A0 


24 


Synchronous reset; placed in high-impedance state 


R/W 


1 


Synchronous reset; deasserted by going to a high level 


STRB 


1 


Synchronous reset; deasserted by going to a high level 


RDY 


1 


Reset has no effect. 


HOLD 


1 


Reset has no effect. 


HOLDA 


1 


Reset has no effect. 






Expansion Interface (49 Pins)t 


XD31 -XDO 


32 


Synchronous reset; placed in high-impedance state 


XA12-XA0 


13 


Synchronous reset; placed in high-impedance state 


XR/W 


1 


Synchronous reset; placed in high-impedance state 


MSTRB 


1 


Synchronous reset; deasserted by going to a high level 


IOSTRB 


1 


Synchronous reset; deasserted by going to a high level 


XRDY 


1 


Reset has no effect. 






wuiiirui wiyfiaio \9 ntt&j 


RESET 


1 


Reset input pin 


INT3-INT0 


4 


Reset has no effect. 


IACK 


1 


Synchronous reset; deasserted by going to a high level 


MC/MP or 


1 


Reset has no effect. 


MCBL/MP 






XF1-XF0 


2 


Asynchronous reset; placed in high-impedance state 



t Present only on TMS320C30 
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Table 6-3. Pin Operation at Reset (Continued) 



Signal 


#Pins 


Operation at Reset 






Serial Port 0 Signals (6 Pins) 


CLKXO 


1 


Asynchronous reset; placed in high-impedance state 


DXO 


1 


Asynchronous reset; placed in high-impedance state 


FSXO 




Asynchronous reset; placed in high-impedance state 


CLKRO 


1 


Asynchronous reset; placed in high-impedance state 


DRO 


1 


Asynchronous reset; placed in high-impedance state 


FSRO 


1 


Asynchronous reset; placed in high-impedance state 
Serial Port 1 Signals (6 Pins) t 


CLKX1 


1 


Asynchronous reset; placed in high-impedance state 


DX1 




Asynchronous reset; placed in high-impedance state 


FSX1 


1 


Asynchronous reset; placed in high-impedance state 


CLKR1 


1 


Asynchronous reset; placed in high-impedance state 


DR1 




Asynchronous reset; placed in high-impedance state 


FSR1 


1 


Asynchronous reset; placed in high-impedance state 
Timer 0 Signal (1 Pin) 


TCLKO 


1 


Asynchronous reset; placed in high-impedance state 
Timer 1 Signal (1 Pin) 


TCLK1 


1 


Asynchronous reset; placed in high-impedance state 
Supply and Oscillator Signals (29 Pins) 


V D D 0-0) 


4 


Reset has no effect. 


IODV DD (1,0) 


2 


Reset has no effect. 


ADV DD (1,0) 


2 


Reset has no effect. 


PDV DD 


1 


Reset has no effect. 


DDV DD (1,0) 


2 


Reset has no effect. 


MDV DD 


1 


Reset has no effect. 


V S S 0-0) 


4 


Reset has no effect. 


t Present only on TMS320C30 
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Table &-3. Pin Operation at Reset (Continued) 



Signal 


#Pins 


Operation at Reset 


DV SS (3-0) 


2 


Reset has no effect. 


CV SS (1 ,0) 


2 


Reset has no effect. 


'Vss 




Reset has no effect. 


v B bp 




Reset has no effect. 


SUBS 




Reset has no effect. 


X1 




Reset has no effect. 


yp/CLKIN 

A£/vLr\lli 




Docot Hoc nn offor*t 
noocri lido iiu 01101*1. 


H1 
n i 




Q\/nr»hronni ic roc of \A/ill e\r\ \f\ itc initial ctoto u/han DCQCT moLrAC a 1 \f\ f\ 
oyiiui iiuiiuuo icooi. win yu iu ilo milieu olcut? wiioii riC-OC. I iiicMsuo a I iqu 

transition. See Chapter 13. 


H3 


1 


Synchronous reset. Will go to its initial state when RESET makes a 1 to 0 
transition. See Chapter 13. 

■■■ ■ ■ f mm m m mm. m M ^ m± % 

Emulation, Test, and Reserved (18 Pins) 


EMUO 


1 


Undefined 


EMU1 


1 


Undefined 


EMU2 


1 


Undefined 


EMU3 


1 


Undefined 


EMU4/SHZ 


1 


Undefined 


EMU5t 


1 


Undefined 


EMU6t 


1 


Undefined 


RSVOt 


1 


Undefined 


RSVlt 


1 


Undefined 


RSV2t 


1 


Undefined 


RSV3t 


1 


Undefined 


RSV4t 


1 


Undefined 


RSV5t 


1 


Undefined 


RSV6' 




Undefined 


RSV7t 




Undefined 


RSV8t 




Undefined 


RSV9t 




Undefined 


RSVIOt 




Undefined 



t Present only on TMS320C30 
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At system reset, the following additional operations are performed: 

□ The peripherals are reset. This is a synchronous operation. The peripheral 
reset is described in Chapter 8. 

□ The external bus control registers are reset. The reset values of the control 
registers are described in Chapter 7. 

□ The following CPU registers are loaded with 0: 

■ ST (CPU status register) 

■ IE (CPU/DMA interrupt enable flags) 

■ IF (CPU interrupt flags) 

■ IOF (I/O flags) 

□ The reset vector is read from memory location Oh and loaded into the PC. 
This vector contains the start address of the system reset routine. 

□ Execution begins. Refer to Example 1 1-1 on page 1 1 -3 for an illustration 
of a processor initialization routine. 

Multiple TMS320C3xs driven by the sa me syste m clock may be reset and syn- 
chronized. When the 1 to 0 transition of RESET occurs, the processor is placed 
on a well-defined internal phase, and all of the TMS320C3xs will come up on 
the same internal phase. 

Unless otherwise specified, all registers are undefined after reset. 
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6.6 Interrupts 

The TMS320C3x supports multiple internal and external interrupts, which can 
be used for a variety of applications. This section discusses the operation of 
these interrupts. 

A functional diagram of the logic used to implement the external interrupt 
inputs is shown in Figure 6-4; the logic for internal interrupts is similar. Addi- 
tional information regarding internal interrupts can be found in Chapter 8. 



Figure 6-4. Interrupt Logic Functional Diagram 
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External interrupts are synchronized internally, as illustrated by the three flip- 
flops clocked by H1 and H3. Once synchronized, the interrupt input will set the 
corresponding interrupt flag register (IF) bit if the interrupt is active. 

External interrupts are latched internally on the falling edge of H 1 (see Chapter 
13 for timing information). An external interrupt must be held low for at least 
one H1/H3 cycle to be recognized by the TMS320C3x. Interrupts should be 
held low for only one or two H1 falling edges. If the interrupt is held low for three 
or more H1 falling edges, multiple interrupts may be recognized. 



6.6.1 Interrupt Vector Table 

Table 6-4 and Table 6-5 contain the interrupt vectors. In the microprocessor 
mode of the TMS320C30 and the TMS320C31 (Table 6-4) and the microcom- 
puter mode of the TMS320C31 (Table 6-5), the interrupt vectors contain the 
addresses of interrupt service routines that should start executing when an in- 
terrupt occurs. On the other hand, in the microcomputer/boot loader mode of 
the TMS320C31 , the interrupt vector contains a branch instruction to the start 
of the interrupt service routine. 
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Table 6-4. Reset, Interrupt, and Trap-Vector Locations for the TMS320C30/TMS320C31 
Microprocessor Mode 



Ad dross 


Routine 


OOh 


RESET 


01 h 


INTO 


02h 


INT1 


03h 


INT2 


04h 


INT3 


05h 


XINTO 


06h 


RINTO 


07h 


XINT1 * 


Uon 


PIMT1 1 


09h 


TINTO 


OAh 


TINT1 


OBh 


DINT 


OCh 




Reserved 


1Fh 




20h 


TRAPO 
• 

• 


3Bh 


• 

TRAP 27 


3Ch 


TRAP 28 (Reserved) 


3Dh 


TRAP 29 (Reserved) 


3Eh 


TRAP 30 (Reserved) 


3Fh 


TRAP 31 (Reserved) 



t Reserved on TMS320C31 
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Table 6-5. Reset, Interrupt, and Trap Vector Locations for the TMS320C31 Microcomputer 
Boot Mode 



Ad dross 


Daficr 1 ntion 

WOwl IIIIIVII 


809FC1 


TnTo 


809FC2 


INT1 


809FC3 


INT2 


809FC4 


INT3 


809FC5 


XINTO 


809FC6 


RINTO 


809FC7 


Reserved 


809FC8 


Reserved 


809FC9 


TINTO 


809FCA 


TINT1 


809FCB 


DiNTO 


809FCC-809FDF 


Reserved 


809FE0 


TRAPO 


809FE1 
• 
• 


TRAP1 
• 


• 

809FFB 


• 
• 

TRAP27 


809FFC-809FFF 


Reserved 



6.6.2 Interrupt Prioritization 

When two interrupts occur in the same clock cycle or when two previously 
received interrupts are waiting to be serviced, one interrupt will be serviced be- 
fore the other. The CPU handles this prioritization by servicing the interrupt 
with the least priority. Table 6-6 shows the priorities assigned to the reset and 
interrupt vectors. 

The CPU controls all prioritization of interrupts (see Table 6-6 for reset and in- 
terrupt vector locations and priorities). 
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Table 6-6. Reset and Interrupt Vector Priorities 



Reset or 
Interrupt 


Vector 
Location 


Priority 


Function 


DCQCT 

ntot 1 


un 


u 


txiernai resei signal mpui on me ntot i pin 


IIN 1 U 


1 h 

i n 


1 


txiernai inierrupi on in© iin i u pin 


IMT1 


Oh 


o 


txiernai inierrupi on ine iin i i pin 


IIN I c. 


on 


Q 
O 


txiernai inierrupi un me iin \ c. pin 


INT** 


Ah 


A 

*T 


external inierrupi un me iin i o pin 


AIIN 1 U 


Oil 


c 

o 


internal inierrupi yeneraiea wnen seriai-pon u iransmn Duner is empiy 


HUN 1 U 


on 


O 


internal inierrupi generated wnen serial-pun u receive uuner is tuii 


YINT1 1 

✓VI IN 1 1 ■ 


7h 

/ 1 1 


7 


Intornol intcurnmf nanarataH v*/h»c*n carlo LnArt 1 trancmit Hirffar* ie amr\H/ 

luminal iinefiupi yeiieraieu wiieii oeiiai-puri i iranoMiii uuner io enipiy 


RINTlt 


8h 


8 


Internal interrupt generated when serial-port 1 receive buffer is full 


TINTO 


9h 


9 


Internal interrupt generated by timer 0 


TINT1 


OAh 


10 


Internal interrupt generated by timer 1 


DINT 


OBh 


11 


Internal interrupt generated by DMA controller 0 



t Reserved on TMS320C31 



6.6.3 Interrupt Control Bits 

Four CPU registers contain bits used to control interrupt operation: 

□ Status Register (ST) 

The CPU global interrupt enable bit (GIE) located in the CPU status regis- 
ter (ST) controls all maskable CPU interrupts. When this bit is set to 1 , the 
CPU responds to an enabled interrupt. When this bit is cleared to 0, all 
CPU interrupts are disabled. Refer to subsection 3.1.7 on page 3-4 for 
more information. 

□ CPU/DMA Interrupt Enable Register (IE) 

This register individually enables/disables CPU and DMA (external, serial 
port, and timer) interrupts. Refer to subsection 3.1 .8 on page 3-7 for more 
information. 

□ CPU Interrupt Flag Register (IF) 

This register contains interrupt flag bits that indicate the corresponding in- 
terrupt is set. Refer to subsection 3.1 .9 on page 3-9 for more information. 
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□ DMA Global Control Register 

Interrupts to the DMA are controlled by the synchronization bits of the 
DMA global control register. DMA interrupts are independent of the ST 
(GIE) bit. 

Interrupt Flag Register Behavior 

When an external interrupt occurs, the corresponding bit of the IF register is 
set to 1 . When the CPU or DMA controller processes this interrupt, the corre- 
sponding interrupt flag bit is cleared by the internal interrupt acknowledge sig- 
nal. It should be noted, however, that if INTn is still low when the interrupt ac- 
knowledge signal occurs, the interrupt flag bit will be cleared for only one cycle 
and then set again because INTn is still low. Accordingly, it is theoretically pos- 
sible that, depending on when the IF register is read, this bit may be 0 even 
though INTn is 0. When the TMS320C3x is reset, 0 is written to the interrupt 
flag register, thereby clearing all pending interrupts. 

The interrupt flag register bits may be read and written under software control. 
Writing a 1 to an IF register bit sets the associated interrupt flag to 1 . Similarly, 
writing a 0 resets the corresponding interrupt flag to 0. In this way, all interrupts 
may be triggered and/or cleared through software. Since the interrupt flags 
may be read, the interrupt pins may be polled in software when an interrupt-dri- 
ven interface is not required. 

Internal interrupts operate in a similar manner. In the IF register, the bit corre- 
sponding to an internal interrupt may be read and written through software. 
Writing a 1 sets the interrupt latch; writing a 0 clears it. All internal interrupts 
are one H1/H3 cycle in length. 

The CPU global interrupt enable bit (GIE), located in the CPU status register 
(ST), controls all CPU interrupts. All DMA interrupts are controlled by the DMA 
global interrupt enable bit, which is not dependent on ST(GIE) and is local to 
the DMA. The DMA global interrupt enable bit is dependent, in part, on the 
state of the DMA SYNC bits. It is not directly accessible through software (see 
Chapter 8). The AND of the interrupt flag bit and the interrupt enables is then 
connected to the interrupt processor. 

6.6.4 Interrupt Processing 

The 'C3x allows the CPU and DMA coprocessor to respond to and process in- 
terrupts in parallel. Figure 6-5 on page 6-28 shows interrupt processing flow; 
for exact sequence, refer to Table 6-7 on page 6-29. 
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Figure 6-5. Interrupt Processing 




If Enabled, 
Interrupt Is 
a CPU Interrupt 



If Enabled, 
Interrupt Is 
a DMA Interrupt 



Disable Interrupts 
GIE«-0 



Clear Interrupt Flag 



Clear Interrupt Flag 



DMA Proceeds According 
to SYNC Bits 



PC -* *(++SP) 



DMA Continues 



Complete All Fetched Instructions 



PC «- Interrupt Vector 



i 



CPU Starts Executing ISR Routine 



Note: CPU and DMA Interrupts 

CPU and DMA interrupts are acknowledged (responded to by the CPU) on 
instruction fetch boundaries only. If instruction fetches are halted because 
of pipeline conflicts or execution of RPTS loops, CPU and DMA interrupts are 
not acknowledged until instruction fetching continues. 
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Table 6-7. Interrupt Latency 



Cycle 


Description 


Fetch 


Decode 


Read 


Execute 


1 


Recognize interrupt in single-cycle fetched 
(prog a+1) instruction. 


prog 
a + 1 


prog a 


prog a-1 


prog a-2 


2 


Temporarily disable interrupt until QIE is cleared. 




interrupt 


prog a 


prog sh1 


3 


Read the interrupt vector table. 






interrupt 


prog a 


4 


Clear Interrupt flag; clear GIE bit; store return address 
to stack. 









interrupt 


5 


Pipeline begins to fill with ISR instruction. 


isr1 








6 


Pipeline continues to fill with ISR instruction. 


isr2 


isr1 






7 


Pipeline continues to fill with ISR instruction. 


isr3 


isr2 


isr1 




8 


Execute first instruction of interrupt service routine. 


isr4 


isr3 


isr2 


isrl 



In the CPU interrupt processing cycle (left side of Figure 6-5), the correspond- 
ing interrupt flag in the IF register is cleared, and interrupts are globally dis- 
abled (GIE = 0). The CPU completes all fetched instructions. The current PC 
is pushed to the top of the stack. The interrupt vector is fetched and loaded into 
the PC, and the CPU starts executing the first instruction in the interrupt ser- 
vice routine (ISR). 

If you wish to make the interrupt service routine interruptible, you can set the 
GIE bit to 1 after entering the ISR. 

The DMA interrupt processing cycle (right side of Figure 6-5) is similar to that 
of the CPU. After the pertinent interrupt flag is cleared, the DMA coprocessor 
proceeds according to the status of the SYNC bits in the DMA coprocessor 
global control register. 

The interrupt acknowledge (IACK) instruction can be used to signal externally 
that an interrupt has b een s erviced. If external memory is specified in the oper- 
and, IACK drives the IACK pin and performs a dummy read. The read is per- 
formed from the address specified by the IACK instruction operand. IACK is 
typically placed in the early portion of an interrupt service routine. However, 
it may be better suited at the end of the interrupt service routine or be totally 
unnecessary. 

Note the following: 

□ Interrupts are disabled during an RPTS and during a delayed branch (until 
the three instructions following a delayed branch are completed). Inter- 
rupts are held until after the branch. 
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□ When an interrupt occurs, instructions currently in the decode and read 
phases continue regular execution. This is not the case for an instruction 
in the fetch phase: 

■ If the interrupt occurs in the first cycle of the fetch of an instruction, the 
fetched instruction is discarded (not executed), and the address of 
that instruction is pushed to the top of the system stack. 

■ If the interrupt occurs after first cycle of the fetch (in the case of a multi- 
cycle fetch due to wait states), that instruction is executed, and the ad- 
dress of the next instruction to be fetched is pushed to the top of the 
system stack. 



6.6.5 CPU Interrupt Latency 

CPU interrupt latency, defined as the time from the acknowledgement of the 
interrupt to the execution of the first interrupt service routine (ISR) instruction, 
is at least eight cycles. This is explained in Table 6-7 on page 6-29, where the 
interrupt is treated as an instruction. It assumed that all of the instructions are 
single-cycle instructions. 



6.6.6 CPU/DMA Interaction 

If the DMA is not using interrupts for synchronization of transfers, it will not be 
affected by the processing of the CPU interrupts. Detected interrupts are re- 
sponded to by the CPU and DMA on instruction fetch boundaries only. Since 
instruction fetches are halted due to pipeline conflicts or when executing 
instructions in an RPTS loop, interrupts will not be responded to until instruc- 
tion fetching continues. It is therefore possible to interrupt the CPU and DMA 
simultaneously with the same or different interrupts and, in effect, synchronize 
their activities. For example, it may be necessary to cause a high-priority DMA 
transfer that avoids bus conflicts with the CPU (that is, that makes the DMA 
higher priority than the CPU). This may be accomplished by using an interrupt 
that causes the CPU to trap to an interrupt routine that contains an IDLE 
instruction. Then if the same interrupt is used to synchronize DMA transfers, 
the DMA transfer counter can be used to generate an interrupt and thus return 
control to the CPU following the DMA transfer. 

Since the DMA and CPU share the same set of interrupt flags, the DMA may 
clear an interrupt flag before the CPU can respond to it. For example, if the 
CPU interrupts are disabled, the DMA can respond to interrupts and thus clear 
the associated interrupt flags. 
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6.6.7 TMS320C3x Interrupt Considerations 

Give careful consideration to TMS320C3x interrupts, especially if you make 
modifications to the status register when the global interrupt enable (GIE) bit 
is set. This can result in the GIE bit being erroneously set or reset as described 
in the following paragraphs. 

The GIE bit is set to 0 by an interrupt. This can cause a processing error if any 
code following within two cycles of the interrupt recognition attempts to read 
or modify the status register. For example, if the status register is being pushed 
onto the stack, it will be stored incorrectly if an interrupt was acknowledged two 
cycles before the store instruction. 

When an interrupt signal is recognized, the TMS320C3x continues executing 
the instructions already in the read and decode phases in the pipeline. Howev- 
er, because the interrupt is acknowledged, the GIE bit is reset to 0, and the 
store instruction already in the pipeline will store the wrong status register 
value. 



For example, if the program is like this: 



NOP 

interrupt recognized — >LDI §V_ADDR, ARl 

MPYI *AR1, RO 

PUSH ST 

POP ST 



the PUSH ST instruction will save the ST contents in memory, which includes 
GIE = 0. Since the device is expected to have GIE = 1 , the POP ST instruction 
will put the wrong status register value into the ST. 

A similar situation may occur if the GIE bit = 1 and an instruction executes that 
is intended to modify the other status bits and leave the GIE bit set. In the 
above example, this erroneous setting would occur if the interrupt were recog- 
nized two cycles before the POP ST instruction. In that case, the interrupt 
would clear the GIE bit, but the execution of the POP instruction would set the 
GIE bit. Since the interrupt has been recognized, the interrupt service routine 
will be entered with interrupts enabled, rather than disabled as expected. 

One solution is to use traps. For example, you can use TRAP 0 to reset GIE 
and use TRAP 1 to set GIE. This is accomplished by making TRAP 0 and 
TRAP 1 be the instructions RETS and RETI, respectively. 
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PUSH 

LDI 

NOP 

NOP 

AND 

POP 



Another alternative incorporates the following code fragment, which protects 
against modifying or saving of the status register by disabling interrupts 
through the interrupt enable register: 



IE 

0, IE 



ODFFFh, ST 

IE 



Save IE register 
Clear IE register 



Set GIE = 0 



Added instructions to 

avoid pipeline problems 

2 NOPs or useful instructions 

Instruction that reads or 
writes to ST register. 
Added instruction 
to avoid pipeline 
problems. 



6.6.8 TMS320C30 Interrupt Considerations 

The TMS320C30 has two unique exceptions to the interrupt operation. 

□ The status register global interrupt enable (GIE) bit may be erroneously 
reset to 0 (disabled setting) if all of the following conditions are true: 

■ A conditional trap instruction (TRAPcond) has been fetched, 

■ The condition for the trap is false, and 

■ A pipeline conflict has occurred, resulting in a delay in the decode or 
read phases of the instruction. 

During the decode phase of a conditional trap, interrupts are temporarily 
disabled to ensure that the trap will execute before a subsequent interrupt. 
If a pipeline conflict occurs and causes a delay in execution of the condi- 
tional trap, the interrupt disabled condition may become the last known 
condition of the GIE bit. In the case that the trap condition is false, inter- 
rupts will be permanently disabled until the GIE bit is intentionally set. The 
condition does not present itself when the trap condition is true, because 
normal operation of the instruction causes the GIE to be reset, and stan- 
dard coding practice will set the GIE to 1 before the trap routine is exited. 
Several instruction sequences that can cause pipeline conflicts have been 
found: 

■ LDI mem,SP 
TRAPcond n 

■ LDI mem,SP 
NOP 

TRAPcond n 
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■ STI 



SP,mem 



TRAPcond 



n 



■ STI 

LDI 



Rx , * ARy 
*ARx,Ry 
*ARz,Rw 



I I LDI 



TRAPcond n 

Other similar conditions may also cause a delay in the execution. There- 
fore, the following solution is recommended to avoid or rectify the problem. 

Insert two NOP instructions immediately prior to the TRAPcond instruc- 
tion. One NOP is insufficient in some cases, as illustrated in the second 
bulleted item, above. This eliminates the opportunity for any pipeline con- 
flicts in the immediately preceding instructions and enables the conditional 
trap instruction to execute without delays. 

□ Asynchronous accesses to the interrupt flag register (IF) can cause the 
TMS320C3x to fail to recognize and service an interrupt. This may occur 
when an interrupt is generated and is ready to be latched into the IF regis- 
ter on the same cycle that the IF is being written to by the CPU. Note that 
logic operations (AND, OR, XOR) may write to the IF register. 

The logic currently gives the CPU write priority; consequently, the as- 
serted interrupt might be lost. This is particularly true if the asserted inter- 
rupt has been generated internally (for example, a direct memory access 
(DMA) interrupt). This situation can arise as a result of a decision to poll 
certain interrupts or a desire to clear pending interrupts due to a long pulse 
width. In the case of a long pulse width, the interrupt may be generated 
after the CPU responds to the interrupt and attempts to automatically clear 
it by the interrupt vector process. 

The recommended solution is not to use the interrupt polling technique but 
to design the external interrupt inputs to have pulse widths of between 1 
and 2 instruction cycles. The alternative to strict polling is to periodically 
enable and disable the interrupts that would be polled, thereby allowing 
the normal interrupt vectoring to take place; that automatically clears the 
interrupt flag without affecting other interrupts. If you need to clear a pend- 
ing interrupt, it is recommended that you use a memory location to indicate 
that the interrupt is invalid. Then the interrupt service routine can read that 
location, clear it (if the pending interrupt is invalid), and return immediately. 
The following code fragments show how a dummy interrupt due to a long 
interrupt pulse might be handled: 



ISR n: 



PUSH ST 
PUSH DP 
PUSH RO 



; Save registers 



LDI 0, DP 



; Clear Data Page Pointer 



Program Flow Control 



6-33 



Interrupts 



LDI 
BNN 
STI 
POP 
POP 
POP 
RETI 



eDUMMYJENT, RO 

ISR_n_START 

DP, @DUMMY_INT 

RO 

DP 

ST 



If DUMMYJENT is 0 or positive, 
go to ISR_n_JSTART 
Set DUMMY INT - 0 



Housekeeping, return from interrupt 



ISR n START? 



ISR n END s 







; Normal interrupt service 






; Code goes here 


LDI 


INT_Fn, RO 




AND 


IF, RO 


; If ones in IF reg match 


BZ 


ISR_n_END 


; INT_Fn, exit ISR 


LDI 


0, DP 


; Otherwise clear 


LDI 


OFFFFh, RO 


; DP and set 


STI 


RO, 8DUMMYJCNT 


; DUMMY__INT negative & exit 


POP 


RO 




POP 


DP 


; Exit ISR 


POP 


ST 




RETI 




} 



6.6.9 Prioritization and Control 



The CPU controls all prioritization of interrupts (see Table 6-6 for reset and in- 
terrupt vector locations and priorities). If the DMA is not using interrupts for 
synchronization of transfers, it will not be affected by the processing of the 
CPU interrupts. Detected interrupts are responded to by the CPU and DMA 
on instruction fetch boundaries only. If instruction fetches are halted due to 
pipeline conflicts or when executing instructions in an RPTS loop, interrupts 
will not be responded to until instruction fetching continues. It is therefore pos- 
sible to interrupt the CPU and DMA simultaneously with the same or different 
interrupts and, in effect, synchronize their activities. For example, it may be 
necessary to cause a high-priority DMA transfer that avoids bus conflicts with 
the CPU, that is, make the DMA higher priority than the CPU. This may be ac- 
complished by using an interrupt that causes the CPU to trap to an interrupt 
routine that contains an IDLE instruction. Then if the same interrupt is used to 
synchronize DMA transfers, the DMA transfer counter can be used to generate 
an interrupt, thereby returning control to the CPU following the DMA transfer. 

Since the DMA and CPU share the same set of interrupt flags, the DMA can 
clear an interrupt flag before the CPU can respond to it. For example, if the 
CPU interrupts are disabled, the DMA can respond to interrupts and thus clear 
the associated interrupt flags. 
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Table 6-8. Reset and Interrupt Vector Locations 



Reset or 

Intairunt 

II I&9I 1 UUl 


Vector 

Location 

kVvBIIWI 1 


Priority 


Function 

■ Ml Ivllvl 1 


RESET 


Oh 


0 


External reset signal input on the RESET pin 


INTO 


1h 


1 


External interrupt input on the INTO pin 


INT1 


2h 


2 


External interrupt input on the INT1 pin 


INT2 


3h 


3 


External interrupt input on the INT2 pin 


INT3 


4h 


4 


External interrupt input on the INT3 pin 


XINTO 


5h 


5 


Internal interrupt generated when serial-port 0 transmit 
buffer is empty 


RINTO 


6h 


6 


Internal interrupt generated when serial-port 0 receive 
buffer is full 


XINT1 t 


7h 


7 


Internal interrupt generated when serial-port 1 transmit 
buffer is empty 


RINT1 t 


8h 


8 


Internal interrupt generated when serial-port 1 receive 
buffer is full 


TINTO 


9h 


9 


Internal interrupt generated by timer 0 


TINT1 


OAh 


10 


Internal interrupt generated by timer 1 


DINT 


OBh 


11 


Internal interrupt generated by DMA controller 0 



t Reserved on TMS320C31 



Program Flow Control 6-35 



TMS320LC31 Power Management Modes 



6.7 TMS320LC31 Power Management Modes 

The TMS320LC31 CPU has been enhanced by the addition of two power man- 
agement modes: 

□ IDLE2, and 

□ LOPOWER. 

6.7.1 IDLE2 

The H1 instruction clock is held high until one of the four external interrupts is 
asserted. In IDLE2 mode, the TMS320C31 behaves as follows: 

□ No instructions are executed. 

□ The CPU, peripherals, and internal memory retain their previous states. 

□ The primary bus output pins are idle: 

■ The address lines remain in their previous states, 

■ The data lines are in the high-impedance state, and 

■ The output control signals are inactive. 

□ When the device is in the functional (non-emulation) mode, the clocks stop 
with H1 high and H3 low (see Figure 6-6). 

□ The 'C31 will remain in IDLE2 until one of the four external interrupts 
(INT3-INT0) is asserted for at least one H1 cycle. When one of the four 
interrupts is asserted, the clocks start after a delay of one H1 cycle. When 
the clocks restart, they may be in the opposite phase (that is, H1 may be 
high if H3 was high before the clocks were stopped; H3 may be high if H1 
was previously high). The H1 and H3 clocks will remain 1 80° out of phase 
with each other (see Figure 6-7). 

□ For one of the four external interrupts to be recognized and serviced by 
the CPU during the IDLE2 operation, the interrupt must be asserted for 
less than three cycles but more than two cycles. 

□ The instruction following the IDLE2 instruction will not be executed until 
after the return from interrupt instruction (RETI) is executed. 

□ When the device is in emulation mode, the H1 and H3 clocks will continue 
to run normally and the CPU will operate as if an IDLE instruction had been 
executed. The clocks continue to run for correct operation of the emulator. 
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Delayed Branch 

For correct device operation, the three Instructions after a delayed 
branch should not be IDLE or IDLE2 instructions, 



Figure 6-6. IDLE2 Timing 
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Figure 6-7. Interrupt Response Timing After IDLE2 Operation 
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6.7.2 LOPOWER 



In the LOPOWER (low power) mode, the CPU continues to execute instruc- 
tions, and the DMA can continue to perform transfers, but at a reduced clock 
rate of CLKIN frequency 
16 

A TMS320C31 with a CLKIN frequency of 32 MHz will perform identically to 
a 2 MHz TMS320C31 with an instruction cycle time of 1 ,000 ns. 



During the read phase of the . . . 


The TMS320C31 . . . 


LOPOWER instruction (Figure 6-8) 


slows to 1/16 of full-speed operation. 


MAXSPEED instruction (Figure 6-9) 


resumes full-speed operation. 



Figure 6-8. LOPOWER Timing 
CLKIN 
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Figure 6-9. MAXSPEED Timing 



CLKIN 



H3 



H1 



f 



32 CLKIN 



MAXSPEED Read 

j — \ r 



\ 



■\ / — \ r 



6-38 



Chapter 7 



External Bus Operation 



Memories and external peripheral devices are accessible through two external 
interfaces on the TMS320C30: 

□ the primary bus, and 

□ the expansion bus. 

On the TMS320C31 , one bus, the primary bus, is available to access external 
memories and peripheral devices. You can control wait-state generation, per- 
mitting access to slower memories and peripherals, by manipulating 
memory-mapped control registers associated with the interfaces and by using 
an external input signal. 

Major topics discussed in this chapter are listed below. 



Topic Page 

7.1 External Interface Control Register , . ...... . . , . . 7-2 

7.2 External Interface Timing 7-6 

7.3 Programmable Wait States . . . ............ 7-28 

7.4 Programmable Bank Switching . 7-30 
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7.1 External Interface Control Registers 

The TMS320C30 provides two external interfaces: the primary bus and the ex- 
pansion bus. The TMS320C31 provides one external interface: the primary 
bus. The primary bus consists of a 32-bit data bus, a 24-bit address bus, and 
a set of control signals. The expansion bus consists of a 32-bit data bus, a 
13-bit address bus, and a set of control signals. Both buses support soft- 
ware-controlled wait states and an external ready input signal, and both buses 
are useful for data, program, and I/O accesses. 

Access is determined by an active strobe signal (STRB, MSTRB, orlOSTRB). 
When a primary bus access is performed, STRB is low. The expansion bus of 
the TMS320C30 supports two types of accesses: 

□ Memory access signalled by MSTRB low. The timing for an MSTRB ac- 
cess is the same as that of the STRB access on the primary bus. 

□ External peripheral device access is signaled by IOSTRB low. 

Each of the buses (primary and expansion) has an associated control register. 
These registers are memory-mapped as shown in Figure 7-1 . 

Figure 7-1. Memory-Mapped External Interface Control Registers 



Register 


Peripheral! 
Address 


Expansion-Bus Control (see subsection 7.1 .2)1" 


808060h 


Reserved 


808061 h 


Reserved 


808062h 


Reserved 


808063h 


Primary-Bus Control (see subsection 7.1.1) 


808064h 


Reserved 


808065h 


Reserved 


808066h 


Reserved 


808067h 


Reserved 


808068h 


Reserved 


808069h 


Reserved 


80806Ah 


Reserved 


80806Bh 


Reserved 


80806Ch 


Reserved 


80806Dh 


Reserved 


80806EM 


Reserved 


80806Fh 



t Reserved on the TMS320C31 
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7.1 .1 Primary-Bus Control Register 

The primary bus control register is a 32-bit register that contains the control 
bits for the primary bus (see Figure 7-2). Table 7-1 lists the register bits with 
the bit names and functions. 

Figure 7-2. Primary-Bus Control Register 



31 30 29 28 27 26 25 24 23 22 21 20 19 18 



17 



16 





H 


XX | 


XX | 


XX | XX | XX | 


XX | 


XX | XX | XX 


XX | XX 


| XX | XX 


i - 1 


15 


14 


13 


12 


11 10 9 


8 


7 6 5 


4 3 


2 1 


0 


i- 


H 


XX | 




BNKCMP 


i 


WTCNT 


SWW 


|hiz|nohold 


| HOLDST| 



R/W R/W R/W R/W R/W R/W R/W R/W R/W R/W R/W R/W 



NOTE: xx = reserved bit, read as 0. 
R = read, W = write. 
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Table 7-1. Primary-Bus Control Register Bits Summary 



Bit Name Reset Value Function 



0 


HOLDST 


xt 


Hold status bit. This bit signals whether the port is being held 
(HOLDST = 1) or is not being held (HOLDST = 0). This status bit is valid 
whether the port has been held via hardware or software. 


1 


NOHOLD 


0 


Port hold signal. NOHOLD allows or disallows the port to be held bv an 
external HOLD signal. When NOHOLD = 1 , the TMS320C3x takes over 
the external bus and controls it, regardless of serviced or pending re- 
quests by external devices. No hold acknowledge (HOLDA) is asserted 
when a HOLD is received. However, it is asserted if an internal hold is 

npnpratpH l\-\\7 — 1 \ MOHOI n ic QPt in O at rpQpt 


2 


HIZ 


0 


Internal hold. When set (HIZ = 1), the port is put in hold mode. This is 
equivalent to the external HOLD signal. By forcing a high-impedance 
condition, the TMS320C3x can relinquish the external memory port 

thrnunh Qoftwprp HOI HA nrtPQ low whpn thp nort iq hIpppH in tho 
uiiv/viyii ov/i i wcii is. nv/Lun ywoo iuw whom 11 1X2 yjyjiv 10 |>/iciv/WJ ill 1110 

high-impedance state. HIZ is set to 0 at reset. 


4-3 


SWW 


11 


Software wait mode. In conjunction with WTCNT, this two-bit field de- 
fines the mode of wait-state generation. It is set to 1 1 at reset. 


7-5 


WTCNT 


111 


Software wait mode. This three-bit field specifies the number of cycles 
to use when in software wait mode for the generation of internal wait 
states. The range is 0 (WTCNT = 0 0 0) to 7 (WTCNT= 111) H1/H3 
cycles. It is set to 1 1 1 at reset. 


12-8 


BNKCMP 


10000 


Bank compare. This five-bit field specifies the number of MSBs of the 
address to be used to define the bank size. It is set to 1 0 0 0 0 at reset. 


31-13 


Reserved 


0-0 


Read as 0. 



tx = Oor 1 
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7.1 .2 Expansion-Bus Control Register 



The expansion-bus control register is a 32-bit register that contains control bits 
for the expansion bus (see Figure 7-3 and Table 7-2). 



Figure 7-3. Expansion-Bus Control Register 



91 ,99, 29, 29, 27, 29, 29, 24, 29, 22, 81, 89, 19, 19, 17. 19, 
xx|xx| xx| xx| xx| xx| xx| xx| xx| xxj xx| xx| xx| xx| xx| xx| 



1§ . H 13 18 11 1ft 9 . 9 . 7 9 ff 4 3 2 1 0 

xx I xx I xx I xx I xx I xx I xx I xx I WTCNT | SWW | xx | xx | xx 

R/WR/WR/W R/W R/W 



NOTE: xx = reserved bit, read as 0. 
R * read, W ■ write. 



Table 7-2. Expansion-Bus Control Register Bits Summary 



Bit 


Name 


Reset 
Value 


Function 


2-0 


Reserved 


000 


Read as 0. 


4-3 


SWW 


11 


Software wait-state generation. In conjunction with the WTCNT, this 
two-bit field defines the mode of wait-state generation. It is set to 1 1 
at reset. 
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WTCNT 


111 


Software wait mode. This three-bit field specifies the number of cycles 
to use when in software wait mode for the generation of internal wait 
states. The range is 0 (WTCNT = 0 0 0) to 7 (WTCNT =111) H1/H3 
clock cycles. It is set to 1 1 1 at reset. 


31-8 


Reserved 


0-0 


Read as 0. 
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7.2 External Interface Timing 

This section discusses functional timing of operations on the primary bus and 
the expansion bus, the TMS320C3x's two independent parallel buses. 
Detailed timing specifications for all TMS320C3x signals are contained in Sec- 
tion 13.6 on page 13-31. 

The parallel buses implement three mutually exclusive a ddress spaces d istin- 
guished through the u se of three separate control signals: STRB, MSTRB, and 
IOSTRB . Th e STRB signal controls accesses on the primary bus, and the 
MSTRB and IOSTRB control accesses on the expansion bus. Since the two 
buses are independent, you can make two accesses in parallel. 

With the exception of bank switching and the external HOLD function (dis- 
cussed later in this section), timing of primary bus cycles and MSTRB expan- 
sion bus cycles are identical and are discussed collectively. The acronym 
(M)STRB is used in references that pert ain equally to STRB and MSTRB. Sim- 
ilarly, (X)R/W, (X)A, (X)D, and (X)RDY ar e used to symbolize the equivalent 
primary and expansion bus signals. The IOSTRB expansion bus cycles are 
timed differently and are discussed independently. 

7.2.1 Primary-Bus Cycles 

All bus cycles comprise integral numbers of H1 clock cycles. One H1 cycle is 
defined to be from one falling edge of H1 to the next falling edge of H1 . For 
full-speed (zero wait-state) accesses, writes require two H1 cycles and reads 
one cycle; however, if the read follows a write, the read requires two 
cycles.This applies to both the primary bus and the MSTRB expansion bus ac- 
cess. Recall that, internally (from the perspective of the CPU and DMA), writes 
require only one cycle if no accesses to that interface are in progress. The fol- 
lowing discussions pertain to zero wait-state accesses unless otherwise spe- 
cified. 



The (M)STRB signal is low for the active portion of both reads and writes. The 
active portion lasts one H1 cycle. Additionally, before and after the active por- 
tion ((M)STRB low) of writes only, there is a transition cycle of H 1 . This transi- 
tion cycle consists of the following sequence: 

1) (M)STRB is high. 

2) If required, (X)R/W changes state on H1 rising. 

3) If required, address changes on H1 rising if the previous H1 cycle was the 
active portion of a write. If the previous H1 cycle was a read, address 
changes on H1 falling. 
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Figure 7-4 illustrates a read-read-write sequence for (M)STRB active and no 
wait states. The data is read as late in the cycle as possible to allow maximum 
access time from address valid. Note that although external writes require two 
cycles, internally (from the perspective of the CPU and DMA) they require only 
one cycle if no accesses to that interface are in progress. In the typical timing 
fo r all exter nal interfaces, the (X)R/W strobe does not change until (M)STRB 
or IOSTRB goes inactive. 



Figure 7-4. Read-Read-Write for (M)STRB = 0 




Note: Back-to-Back Read Operations 



(M)STRB will remain low during back-to-back read operations, 
i 
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Figure 7-5 illustrates a write-write-read sequence for (M)STRB active and no 
wait states. The address and data written are held valid approximately 
one-half cycle after (M)STRB changes. 



Figure 7-5. Write-Write-Read for (M)STRB = 0 
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Figure 7-6 illustrates a read cycle with one wait state. Since (X)RDY = 1 , the 
read cycle is extended. (M)STRB, (X)R/W, and (X)A are also extended one 
cycle. The next time (X)RDY is sampled, it is 0. 



Figure 7-6. Use of Wait States for Read for (M)STRB = 0 
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Figure 7-7 illustrates a write cy cle with on e waitstate. Since initially (X)RDY = 
1 , the write cycle is extended. (M)STRB, (X)R/W, and (X)A are extended one 
cycle. The next time (X)RDY is sampled, it is 0. 



Figure 7-7. Use of Wait States for Write for (M)STRB = 0 
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7.2.2 Expansion-Bus I/O Cycles 



In contrast to primary bus and MSTRB cycles, IOSTRB reads and writes are 
both two cycles in duration (with no wait states) and exhibit the same timing. 
During these cycles, address always changes on the falling edge of H1 , and 
IOSTRB is low from the rising edge of the first H1 cycle to the rising edge of 
the second H1 cycle. The IOSTRB signal always goes inactive (high) between 
cycles, and XR/W is high for reads and low for writes. 



Figure 7-8 illustrates read and write cycles when IOSTRB is active and there 
are no wait states. For IOSTRB accesses, reads and writes require a minimum 
of two cycles. Some off-chip peripherals might change their status bits when 
read or written to. Therefore, it is important to maintain valid addresses when 
communicating with these peripherals. For reads and writes when IOSTRB is 
active, IOSTRB is completely framed by the address. 



Figure 7-8. Read and Write for IOSTRB = 0 
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Figure 7-9 illustrates a read with one wait state when I OSTRB is active, and 
Figure 7-1 0 illustrates a write with one wait state when IOSTRB is active. For 
each wait state added, IOSTRB, XR/W, and XA are extended one cloc k cycle. 
Writes hold the data on the bus one additional cycle. The sampling of XRDY 
is repeated each cycle. 



Figure 7-9. Read With One Wait State for IOSTRB = 0 
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Figure 7-11, Figure 7-12, Figure 7-13, Figure 7-14, Figure 7-15, 
Figure 7-1 6, Figure 7-17, Figure 7-18, Figure 7-19, Figure 7-20, and 
Figure 7-21 illustrate the various transitions between memory reads and 
writes, and I/O writes over the expansion bus. 



Figure 7-11. Memory Read and I/O Write for Expansion Bus 
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Figure 7-12. Memory Read and I/O Read for Expansion Bus 
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Figure 7-18. I/O Read and Memory Read for Expansion Bus 
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Figure 7-19. I/O Write and I/O Read for Expansion Bus 
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Figure 7-20. I/O Write and I/O Write for Expansion Bus 
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Figure 7-21. I/O Read and I/O Read for Expansion Bus 
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Figure 7- 22 and F igur e 7-23 illu strate the signal states when a bus is i nactive 
(after an IO STRB or (M)STRB_access, respectively). The strobes (STRB, 
MSTRB and I OSTRB ) and(X )R/W) goto 1 . The address is undefined, and the 
ready signal (XRDY or RDY) is ignored. 



Figure 7-22. Inactive Bus States for IOSTRB 
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Figure 7-24 illustrates the timing for HOLD and HOLDA. HOLD is an external 
asynchronous input. There is a minimum of one cycle delay fro m the tim e when 
the processor recognizes HOLD = 0 until HOLDA = 0. When HOLDA = 0, the 
address, data buses, and associated strobes are placed in a high-impedance 
state. All accesses occurring over an interface are complete before a hold is 
acknowledged. 



Figure 7-24. HOLD and HOLDA Timing 
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7.3 Programmable Wait States 

You can control wait-state generation by manipulating memory-mapped con- 
trol registers associated with both the primary and expansion interfaces. Use 
the WTCNT field to load an internal timer, and use the SWW field to select one 
of the following four modes of wait-state generation: 

□ External RDY 

□ WTCNT-gener ated R DY wtcnt 

□ Logical- AND of RDY an d RDY wtcnt 

□ Logical-OR of RDY and RDYwtcnt 

The four modes are used to ge nerate the internal ready signal, RDYj nt , that 
controls accesse s. As long as RDYj nt = 1 , the current external access is 
delayed. When RDYj nt = 0, the current access completes. Since the use of 
programmable wait states for both external interfaces is identical, only the pri- 
mary bus interface is described in the following paragraphs. 

RDYwtcnt is an internally generated ready signal. When an external access is 
begun, the value in WTCNT is loaded into a counter. WTCNT can be any value 
from 0 through 7. The counter is decremented every H1/H3 clock cycle until 
it becomes 0. Once the counter is set to 0, i t remains set to 0 until the next ac- 
cess. While the counter is nonzero, RDYwtcnt = 1 . While the counter is 0, 
RDY wtcnt = 0- 
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When SWW = 0 0, RDYj nt depends only on RDY. RDY^^ is ignored. 
Table 7-3 is the truth table for this mode. 

Table 7-3. Wait-State Generation When SWW= 0 0 



RDY 


RDY wtcnt 


RDY lnt 


0 


0 


0 


0 


1 


0 


1 


0 


1 


1 


1 


1 



When SWW = 01, RDYj nt depends only on RDY^^. RDY is ignored. 
Table 7-4 is the truth table for this mode. 

Table 7-4. Wait-State Generation When SWW = 0 1 



RDY 


RDY wtcnt 


RDY lnt 


0 


0 


0 


0 


1 


1 


1 


0 


0 


1 


1 


1 



When SWW = 1 0, RDYjnt is t he log ical-OR (electrical-AND, since these sig- 
nals are low true) of RDY and RDYjnt (see Table 7-5). 

Table 7-5. Wait-State Generation When SWW= 1 0 



RDY 


RDY wtcnt 


RDY lnt 


0 


0 


0 


0 


1 


0 


1 


0 


0 


1 


1 


1 



When SWW = 11, RDYj nt is the logical-AND (electrical-OR, since these sig- 
nals are low true) of RDY and RDYwtcnt. The truth table for this mode is 
Table 7-6. 

Table 7-6. Wait-State Generation When SWW= 1 1 



RDY 


RDY wtcnt 


RDY lnt 


0 


0 


0 


0 


1 


1 


1 


0 


1 


1 


1 


1 
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7.4 Programmable Bank Switching 

Programmable bank switching allows you to switch between external memory 
banks without externally inserting wait states due to memories that require 
several cycles to turn off. Bank switching is implemented on the primary bus 
and not on the expansion bus. 

The size of a bank is determined by the number of bits specified to be ex- 
amined on the BNKCMP field of the primary bus control register (see 
Table 7-1 on page 7-4). For example (see Figure 7-25), if BNKCMP = 16, 
the 16 MSBs of the address are used to define a bank. Since addresses are 
24 bits, the bank size is specified by the eight LSBs, yielding a bank size of 256 
words. If BNKCMP * 1 6, only the 1 6 MSBs are compared. Bank sizes from 2 8 
= 256 to 2 24 = 16M are allowed. Table 7-7 summarizes the relationship be- 
tween BNKCMP, the address bits used to define a bank, and the resulting bank 
size. 

Figure 7-25. BNKCMP Example 
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Table 7-7. BNKCMP and Bank Size 



BNKCMP 


MSBs Defining a Bank 


Bank Size (32-Bit 


00000 


None 


2 24 =16M 


00001 


23 


223= 8M 


00010 


23—22 


222= 4M 


00011 


23—21 


221 =2M 


00100 


23—20 


220= 1M 


00101 


23—19 


219=512K 


00110 


23—18 


2 18 = 256K 


00111 


23—17 


217=128K 


01000 


23—16 


216= 64K 


01001 


23—15 


2 15 = 32K 


01010 


23—14 


2 14 =16K 


01011 


23—13 


213= 8K 


01100 


23—22 


212= 4K 


01101 


23—11 


2H=2K 


01110 


23—12 


21<>=1K 


01111 


23—9 


29 =512 


10000 


23—8 


2 8 =256 


10000—11111 


Reserved 


Undefined 



Words) 



23 
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The TMS320C3x has an internal register that contains the MSBs (as defined 
by the BNKCMP field) of the last address used for a read or write over the pri- 
mary interface. At reset, the register bits are set to 0. If the MSBs of the address 
being used for the current primary interface read do not match those contained 
in this internal register, a read cycle is not asserted for one H1/H3 clock cycle. 
During thi s extra clock cycle, the address bus switches over to the new ad- 
dress, but STRB is inactive (high). The contents of the internal register are re- 
placed with the MSBs being used for the current read of the current address. 
If the MSBs of the address being used for the current read match the bits in 
the register, a normal read cycle takes place. 

If repeated reads are performed from the same memory bank, no extra cycles 
are inserted. When a read is performed from a different memory bank, memory 
conflicts are avoided by the insertion of an extra cycle. This feature can be dis- 
abled by setting BNKCMP to 0. The insertion of the extra cycle occurs only 
when a read is performed. The changing of the MSBs in the internal register 
occurs for all reads and writes over the primary interface. 

Figure 7-26 illustrates the addition of an inactive cycle when switches be- 
tween banks of memory occur. 



Figure 7-26. Bank-Switching Example 
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The TMS320C3x features two timers, two serial ports (one on the 
TMS320C31), and an on-chip direct memory access (DMA) controller. These 
peripheral modules are controlled through memory-mapped registers located 
on the dedicated peripheral bus. 

The DMA controller is used to perform input/output operations without interfer- 
ing with the operation of the CPU. Therefore, it is possible to interface the 
TMS320C3x to slow external memories and peripherals (A/Ds, serial ports, 
etc.) without reducing the computational throughput of the CPU. The result is 
improved system performance and decreased system cost. 

Major topics discussed in this chapter on peripherals are listed below. 



Topic Page 
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8.1 Timers 

The TMS320C3x timer modules are general-purpose, 32-bit, timer/event 
counters, with two signaling modes and internal or external clocking (see 
Figure 8-1 ) . You can use the timer modules to signal to the TMS320C3X or the 
external world at specified intervals or to count external events. With an inter- 
nal clock, you can use the timer to signal an external A/D converter to start a 
conversion, or it can interrupt the TMS320C3x DMA controller to begin a data 
transfer. The timer interrupt is one of the internal interrupts. With an external 
clock, the timer can count external events and interrupt the CPU after a speci- 
fied number of events. Each timer has an I/O pin that you can use as an input 
clock to the timer, an output clock signal, or a general-purpose I/O pin. 



Figure 8-1. Timer Block Diagram 
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Three memory-mapped registers are used by each timer: 

□ Global-Control Register 

The global-control register determines the operating mode of the timer, 
monitors the timer status, and controls the function of the I/O pin of the timer. 

□ Period Register 

The period register specifies the timer's signaling frequency. 
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□ Counter Register 

The counter register contains the current value of the incrementing count- 
er. You can increment the timer on the rising edge or the falling edge of the 
input clock. The counter is zeroed and can cause an internal interrupt 
whenever its value equals that in the period register. The pulse generator 
generates two types of external clock signals: pulse or clock. The memory 
map for the timer modules is shown in Figure 8-2. 



Figure 8-2. Memory-Mapped Timer Locations 
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Reserved 


808023h 


808033h 


Timer Counter (See subsection 8.1 .2) 


808024h 


808034h 


Reserved 


808025h 


808035h 


Reserved 


808026h 


808036h 


Reserved 


808027h 


808037h 


Timer Period (See subsection 8.1 .2) 


808028h 


808038h 


Reserved 


808029h 


808039h 


Reserved 


80802Ah 


80803Ah 


Reserved 


80802Bh 


80803Bh 


Reserved 


80802Ch 


80803Ch 


Reserved 


80802Dh 


80803Dh 


Reserved 


80802Eh 


80803Eh 


Reserved 


80802Fh 


80803Fh 



8.1 .1 Timer Global-Control Register 

The timer global control register is a 32-bit register that contains the global and 
port control bits for the timer module. Table 8-1 defines this register's bits, 
names, and functions. Bits 3-0 are the port control bits; bits 11-6 are the tim- 
er global control bits. Figure 8-3 shows the 32-bit register. Note that at reset, 
all bits are set to 0 except for DATIN (which is set to the value read on TCLK). 
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Figure 8-3. Timer Global-Control Register 
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R = Read, W = Write, xx= reserved bit, read as 0 



Table 8-1. Timer Global-Control Register Bits Summary 



Bits Name Reset Value Function 



0 


FUNC 


0 


FUNC controls the function of TCLK. If FUNC = 0, TCLK is confi- 
gured as a general-purpose digital I/O port. If FUNC = 1 , TCLK is 
configured as a timer pin (see Figure 8-4 for a description of the 
relationship between FUNC and CLKSRC). 


1 


I/O 


0 


If FUNC = 0 and CLKSRC = 0, JCLK is configured as a general- 
purpose I/O pin. In this case, jf T/O = 0, TCLK is configured as a 
general-purpose input pin. If T/O = 1 , TCLK is configured as a gen- 
eral-purpose output pin. 


2 


DATOUT 


0 


DATOUT drives TCLK when the TMS320C3x is in I/O port mode. 
You can use DATOUT as an input to the timer. 


3 


DATIN 


xt 


Data input on TCLK or DATOUT. A write has no effect. 


5-4 


Reserved 


0-0 


Read as 0. 


6 


GO 


0 


The GO bit resets and starts the timer counter. When GO = 1 and 
the timer is not held, the counter is zeroed and begins increment- 
ing on the next rising edge of the timer input clock. The GO bit is 
cleared on the same rising edge. GO = 0 has no effect on the 
timer. 


7 


HLD 


0 


Counter hold signal. When this bit is 0, the counter is disabled and 
held in its current state. If the timer is driving TCLK, the state of 
TCLK is also held. The internal divide-by-two counter is also held 
so that the counter can continue where it left off when HLD is set to 
1 . You can read and modify the timer registers while the timer is 
being held. RESET has priority over HLD. Table &-2 shows the 
effect of writing to GO and HLD. 


8 


C/P 


0 


Clock/Pulse mode control. When C/P = 1 , clock mode is chosen, 
and the signaling of the TSTAT flag and external output will have a 
50 percent duty cycle. When C/P = 0, the status flag and external 
output will be active for one H1 cycle during each timer period (see 
Figure 8-5 on page 8-7). 



t x = 0or 1 
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Table 8-1. Timer Global-Control Register Bits Summary (Continued) 



Bits Name Reset Value Function 



9 


CLKSRC 


0 


Specifies the source of the timer clock. When CLKSRC = 1 , an inter- 
nal clock with frequency equal to one-half of the H1 frequency is 
used to increment the counter. The INV bit has no effect on the inter- 
nal clock source. When CLKSRC = 0, you can use an external signal 
from the TCLK pin to increment the counter. The external clock is 
synchronized internally, thus allowing external asynchronous clock 
sources that do not exceed the specified maximum allowable exter- 
nal clock frequency. This will be less than f(H1)/2. (See Figure 8-4 
for a description of the relationship between FUNC and CLKSRC). 


10 


INV 


0 


Inverter control bit. If an external clock source is used and INV = 1 , the 
external clock is inverted as it goes into the counter. If the output of the 
Dulse nenerator is routed to TCLK and INV = 1 the outDut is inverted 
before it goes to TCLK (see Figure 8—1). If INV = 0, no inversion is 
performed on the input or output of the timer. The INV bit has no effect, 
regardless of its value, when TCLK is used in I/O port mode. 


11 


TSTAT 


0 


This bit indicates the status of the timer. It tracks the output of the 
uninverted TCLK pin. This flag sets a CPU interrupt on a transition from 
0 to 1 . A write has no effect. 


31-12 


Reserved 


0-0 


Read as 0. 



tx = 0or1 
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Figure 8-4. Timer Modes as Defined by CLKSRC and FUNC 
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Figure 8-5. Timer Timing 
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(b) TSTAT and timer output (INV = 0) when C/P = 1 (clock mode) 

The rate of timer signaling is determined by the frequency of the timer input 
clock and the period register. The following equations are valid with either an 
internal or an external timer clock: 

f (pulse mode) = f (timer clock) / period register 

f (clock mode) = f (timer clock) / (2 x period register) 

i 1 

Note: Period Register 

If the period register equals 0, refer to Section 8.1 .2. 



Table 8-2 shows the result of a write using specified values of the GO and HLD 
bits in the global control register. 
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Table 8-2. Result of a Write of Specified Values of GO and HLD 



GO 


HLD 


Result 


0 


0 


All timer operations are held. No reset is performed. (Reset value) 


0 


1 


Timer proceeds from state before write. 


1 


0 


All timer operations are held, including zeroing of the counter. The 
GO bit is not cleared until the timer is taken out of hold. 


1 


1 


Timer resets and starts. 



8.1 .2 Timer Period and Counter Registers 

The 32-bit timer period register is used to specify the frequency of the timer 
signaling. The timer counter register is a 32-bit register, which is reset to 0 
whenever it increments to the value of the period register. Both registers are 
set to 0 at reset. 

Certain boundary conditions affect timer operation. These conditions are listed 
below: 

□ When the period and counter registers are 0, the operation of the timer is 
dependent upon the C/P mode selected. In pulse mode (C/P = 0), TSTAT 
is set and remains set. In clock mode (C/P = 1), the width of the cycle is 
2/f(H1), and the external clocks are ignored. 

□ When the counter register is not 0 and the period register = 0, the counter 
will count, roll over to 0, and then behave as described above. 

□ When the counter register is set to a value greater than the period register, 
the counter may overflow when being incremented. Once the counter 
reaches its maximum 32-bit value (OFFFFFFFFh), it simply clocks over to 
0 and continues. 

Writes from the peripheral bus override register updates from the counter and 
new status updates to the control register. 

8.1 .3 Timer Pulse Generation 

The timer pulse generator (see Figure 8-1 on page 8-2) can generate sever- 
al external signals. You can invert these signals with the INV bit. The two basic 
modes are pulse mode and clock mode, as shown in Figure 8-5 on page 8-7. 
In both modes, an internal clock source f (timer clock) has a frequency of 
f(H1)/2, and an externally generated clock source f (timer clock) can have a 
maximum frequency of f(H1)/2.6. Refer to timer timing in subsection 13.5.16 
on page 13-66. In pulse mode (C/P = 0), the width of the pulse is 1/f(H1). 
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Figure 8-6 provides some examples of the TCLKx output when the period reg- 
ister is set to various values and clock or pulse mode is selected. 



Figure 8-6. Timer Output Generation Examples 



H1 



■2H1 



nJTJ-LT 



(a) INV = 0, C/P = 0 (Pulse Mode) 
Timer Period = 1 
Also, 

INV = 0, C/P = 1 (Clock Mode) 
Timer Period = 0 



H1 -*| 




(b) INV = 0, C/P = 0 (Pulse Mode) 
Timer Period = 2 



H1 -*| 



JH-6H1 -*| 



n tl 



(c) INV = 0, C/P = 0 (Pulse Mode) 
Timer Period « 3 



|*-4H1 -* 
-» ] 2H1p 



(d) INV = 0, C/P = 1 (Clock Mode) 
Timer Period = 1 



■8H1 



4H1 



12H1 



6H1 



(e) INV = 0, C/P = 1 (Clock Mode) 
Timer Period = 2 



(f) INV = 0, C/P = 1 (Clock Mode) 
Timer Period = 3 
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8.1 .4 Timer Operation Modes 



The timer can receive its input and send its output in several different modes, 
depending upon the setting of CLKSRC, FUNC, and I/O. The four timer modes 
of operation are defined as follows: 

□ If CLKSRC = 1 and FUNC = 0, the timer input comes from the internal 
clock. The internal clock is not affected by the INV bit. In this mode, TCLK 
is connected to the I/O port control, and you use TCLK as a general-pur- 
pose I/O pin (see Figure 8-7). If I/O = 0, TCLK is configured as a general- 
purpose input pin whose state you can read in DATIN. DATOUT has no 
effect on TCLK or DATIN. If 1/0 = 1, TCLK is configured as a 
general-purpose output pin. DATOUT is placed on TCLK and can be read 
in DATIN. 



Figure 8-7. Timer I/O Port Configurations 



Internal 



DATOUT (NC) 



External 



TCLK 



DATOUT - 



DATIN 
!/O = 0 
(a) 

Internal 
1 



External 
TCLK 



□ 



□ 



DATIN 
1/0 = 1 
(b) 

If CLKSRC = 1 and FUNC = 1, the timer input comes from the internal 
clock, and the timer output goes to TCLK. This value can be inverted using 
INV, and you can read in DATIN the value output on TCLK. 

If CLKSRC = 0 and FUNC = 0, the timer is driven according to the status 
of the I/O bit. If I/O = 0, the timer input comes from TCLK. This value can 
be inverted using INV, and you can read in DATIN the value of TCLK. If I/O 
= 1 , TCLK is an output pin. Then, TCLK and the timer are both driven by 
DATOUT. All 0-to-1 transitions of DATOUT increment the counter. INV has 
no effect on DATOUT. You can read in DATIN the value of DATOUT. 

If CLKSRC = 0 and FUNC = 1 , TCLK drives the timer. If INV = 0, all 0-to-1 
transitions of TCLK increment the counter. If INV = 1, all 1 -to-0 transitions 
of TCLK increment the counter. You can read in DATIN the value of TCLK. 
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Figure 8-4 on page 8-6 shows the four timer modes of operation. 



8.1 .5 Timer Interrupts 



A timer interrupt is generated whenever the TSTAT bit of the timer control reg- 
ister changes from a 0 to a 1 . The frequency of timer interrupts depends on 
whether the timer is set up in pulse mode or clock mode. 

□ In pulse mode, the interrupt frequency is determined by the following 
equation: 

f (interrupt) = f (timer clock) herQ 
(interrupt; penod register ' 

f (interrupt) = timer frequency 
f (timer clock) = interrupt frequency 

□ In clock mode, the interrupt frequency is determined by the following equa- 
tion: 

f (jnt t) = f (timer clock) here 
(interrupt; 2 x period register 

f (interrupt) = timer frequency 
f (timer clock) = interrupt frequency 

The timer counter is automatically reset to 0 whenever it is equal to the value 
in the timer period register. You can use the timer interrupt for either the CPU 
or the DMA. Interrupt enable control for each timer, for either the CPU or the 
DMA, is found in the CPU/DMA interrupt enable register. Refer to subsection 
3.1.8 on page 3-7 for more information on the CPU/DMA interrupt enable 
register. 

When a timer interrupt occurs, a change in the state of the corresponding 
TCLK pin will be observed if FUNC = 1 and CLKSRC = 1 in the timer global- 
control register. The exact change in the state depends on the state of the 
C/P bit. 
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8.1 .6 Timer Initialization/Reconfiguration 

The timers are controlled through memory-mapped registers located on the 
dedicated peripheral bus. Following is the general procedure for initializing 
and/or reconfiguring the timers: 

1 ) Halt the timer by clearing the GO/HLD bits of the timer global-control regis- 
ter. To do this, write a 0 to the timer global-control register. Note that the 
timers are halted on RESET. 

2) Configure the timer via the timer global-control register (with GO = HLD 
= 0 ), the timer counter register, and timer period register, if necessary. 

3) Start the timer by setting the GO/HLD bits of the timer global-control 
register. 
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8.2 Serial Ports 

The TMS320C30 has two totally independent bidirectional serial ports. Both 
serial ports are identical, and there is a complementary set of control registers 
in each one. Only one serial port is available on the TMS320C31 . You can con- 
figure each serial port to transfer 8, 1 6, 24, or 32 bits of data per word simulta- 
neously in both directions. The clock for each serial port can originate either 
internally, via the serial port timer and period registers, or externally, via a 
supplied clock. An internally generated clock is a divide-down of the clockout 
frequency, f (H 1 ) . A continuous transfer mode is available, which allows the se- 
rial port to transmit and receive any number of words without new synchroniza- 
tion pulses. 

Eight memory-mapped registers are provided for each serial port: 

□ Global-control register 

□ Two control registers for the six serial I/O pins 

□ Three receive/transmit timer registers 

□ Data-transmit register 

□ Data-receive register 

The global-control register controls the global functions of the serial port and 
determines the serial-port operating mode. Two port control registers control 
the functions of the six serial port pins. The transmit buffer contains the next 
complete word to be transmitted. The receive buffer contains the last complete 
word received. Three additional registers are associated with the transmit/re- 
ceive sections of the serial-port timer. A serial-port block diagram is shown in 
Figure 8-8 on page 8-1 4, and the memory map of the serial ports is shown in 
Figure 8-9 on page 8-15. 
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Figure 8-8. Serial-Port Block Diagram 
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Figure 8-9. Memory-Mapped Locations for the Serial Ports 

Register Peripheral Address 





dsnai 
PortO 


oenai 
Portlt 


Serial-Port Global Control (See Figure 8-10) 


808040h 


808050h 


Reserved 


808041 h 


808051 h 


FSX/DX/CLKX Port Control (See Figure 8-11) 


808042h 


808052h 


FSR/DR/CLKR Port Control (See Figure 8-12) 


808043h 


808053h 


R/X Timer Control (See Figure 8-13) 


808044h 


808054h 


R/X Timer Counter (See Figure 8-14) 


808045h 


808055h 


R/X Timer Period (See Figure 8-15) 


808046h 


808056h 


Reserved 


808047h 


808057h 


Data Transmit (See Figure 8-16) 


808048h 


808058h 


Reserved 


808049h 


808059h 


Reserved 


80804Ah 


80805Ah 


Reserved 


80804Bh 


80805Bh 


Data Receive (See Figure 8-17) 


80804Ch 


80805Ch 


Reserved 


80804Dh 


80805Dh 


Reserved 


80804Eh 


80805Eh 


Reserved 


80804Fh 


80805Fh 



t Reserved locations on the TMS320C31 

8.2.1 Serial-Port Global-Control Register 

The serial-port global-control register is a 32-bit register that contains the glob- 
al control bits for the serial port. Table 8-3 defines the register bits, bit names, 
and bit functions. The register is shown in Figure 8-10. 

Table 8-3. Serial-Port Global-Control Register Bits Summary 



Bit Name Reset Value Function 



0 


RRDY 


0 


If RRDY = 1 , the receive buffer has new data and is ready to be read. A 
three H1/H3 cycle delay occurs from the loading of DRR to RRDY = 1 . The 
rising edge of this signal sets RINT. If RRDY= 0 at reset, the receive buffer 
does not have new data since the last read. RRDY = 0 at reset and after 
the receive buffer is read. 


1 


XRDY 


1 


If XRDY = 1 , the transmit buffer has written the last bit of data to the shifter 
and is ready for a new word. A three H1/H3 cycle delay occurs from the 
loading of the transmit shifter until XRDY is set to 1 . The rising edge of this 
signal sets XINT. If XRDY=0, the transmit buffer has not written the last 
bit of data to the transmit shifter and is not ready for a new word. XRDY = 
1 at reset. 


2 


FSXOUT 


0 


This bit configures the FSX pin as an input (FSXOUT = 0) or an output 
(FSXOUT=1). 
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Table 8-3. Serial-Port Global-Control Register Bits Summary (Continued) 



Bit 


Name 


Reset Value 


Function 


3 


XSREMPTY 


0 


If XSREMPTY = 0, the transmit shift register is empty. If XSREMPTY = 1 , 
the transmit shift register is not empty. Reset or XRESET causes this bit 
to = 0. 


4 


RSRFULL 


0 


If RSRFULL = 1, an overrun of the receiver has occurred. In continuous 
mode, RSRFULL is set to 1 when both RSR and DRR are full. I n noncontin- 
uous mode, RSRFULL is set to 1 when RSR and DRR are full and a new 
FSR is received. A read causes this bit to be set to 0. This bit can be set 
to 0 only by a system reset, a serial-port receive reset (RRESET = 1), or 
a read. When the receiver tries to set RSRFULL to 1 at the same time that 
the global register is read, the receiver will dominate, and RSRFULL is set 
to 1 . If RSRFULL = 0, no overrun of the receiver has occurred. 


5 


HS 


0 


If HS = 1 , the handshake mode is enabled. If HS = 0, the handshake mode 
is disabled. 


6 


XCLKSRCE 


0 


If XCLKSRCE = 1 , the internal transmit clock is used. If XCLKSRCE = 0, 
the external transmit clock is used. 


7 


RCLKSRCE 


0 


If RCLKSRCE = 1 , the internal receive clock is used. If RCLKSRCE = 0, 
the external receive clock is used. 


8 


XVAREN 


0 


This bit specifies fixed (XVAREN = 0) or variable (XVAREN = 1) data rate 
signaling when transmitting. With afixed data rate, FSX is active for at least 
one XCLK cycle and then goes inactive before transmission begins. With 
variable data rate, FSX is active while all bits are being transmitted. When 
you use an external FSX and variable data rate signaling, the DX pin is driv- 
en by the transmitter when FSX is held active or when a word is being 
shifted out. 


9 


RVAREN 


0 


This bit specifies fixed (RVAREN = 0) or variable (RVAREN = 1) data rate 
signaling when receiving. With a fixed data rate, FSR is active for at least 
one RCLK cycle and then goes inactive before the reception begins. With 
variable data rate, FSR is active while all bits are being received. 


10 


XFSM 


0 


Transmit frame sync mode. Configures the port for continuous mode oper- 
ation(XFSM = 1) or standard mode (XFSM = 0). In continuous mode, only 
the first word of a block generates a sync pulse, and the rest are simply 
transmitted continuously to the end of the block. In standard mode, each 
word has an associated sync pulse. 


11 


RFSM 


0 


Receive frame sync mode. Configures the port for continuous mode 
(RFSM =1) or standard mode (RFSM = 0) operation. In continuous mode, 
only the first word of a block generates a sync pulse, and the rest are simply 
received continuously without expectation of another sync pulse. In stan- 
dard mode, each word received has an associated sync pulse. 


12 


CLKXP 


0 


CLKX polarity. If CLKXP = 0, CLKX is active high. If CLKXP = 1 , CLKX is 
active low. 
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Table 8-3. Serial-Port Global-Control Register Bits Summary (Continued) 



Bit 


Name 


Reset Value 


Function 


13 


CLKRP 


0 


CLKR polarity. If CLKRP = 0, CLKR is active (high). If CLKRP =1 , CLKR 
is active (low). 


14 


DXP 


0 


DX polarity. If DXP = 0, DX is active (high). If DXP = 1 , DX is active (low). 


15 


DRP 


0 


DR polarity. If DRP = 0, DR is active (high). If DRP = 1 , DR is active (low). 


16 


FSXP 


0 


FSX polarity. If FSXP = 0, FSX is active (high). If FSXP = 1, FSX is 
active (low). 


17 


FSRP 


0 


FSR polarity. If FSRP = 0, FSR is active (high). If FSRP = 1, FSR is 
active (low). 


1&-18 


XLEN 


00 


These two bits define the word length of serial data transmitted. All data 

ic acci imoH tA ha ririht.ii ictifiaH in tha trancmit hi iff or tuhon fa\Asar than *XO 

IO CIOOUIIIOU IV/ UO 1 lyl ll JUOIIIIOU III III© UCIIIOIIIII UUIIOI WHOM lOWOI IIICIII Ofa 

bits are specified. 

0 0—8 bits 1 0 — 24 bits 
0 1 — 16 bits 11 — 32 bits 


21-20 


RLEN 


00 


These two bits define the word length of serial data received. All data is 
right-justified in the receive buffer. 

0 0 — 8 bits 10 — 24 bits 

n 1 16 bits 1 1 3? bits 


22 


XTINT 


0 


Transmit timer interrupt enable. If XTINT = 0, the transmit timer interrupt 

ic HioahlaH If VTIMT — 1 tha trancmit timar intairimt ic anahlaH 
lo uioauicfu. ii y\ I iin i — i, uic; iicmi&iiiii uiiioi iiHofiupi lo uncftuiuu. 


23 


XINT 


0 


Transmit interrupt enable. If XINT = 0, the transmit interrupt is disabled. If 
XINT= 1 , the transmit interrupt is enabled. Note that the CPU receive flag 
XINT and the serial port-to-DMA interrupt (EXINT0 in the IE register) is the 
OR of the enabled transmit timer interrupt and the enabled transmit inter- 
rupt. 


24 


RTINT 


0 


Receive timer interrupt enable. If RTINT = 0, the receive timer interrupt is 
disabled. If RTINT = 1 , the receive timer interrupt is enabled. 


25 


RINT 


0 


Receive interrupt enable. If RINT = 0, the receive interrupt is disabled. If 
RINT= 1 , the receive interrupt is enabled. Note that the CPU receive flag 
RINT and the serial-port-to-DMA interrupt (ERINT0 in the IE register) is the 
OR of the enabled receive timer interrupt and the enabled receive inter- 
rupt. 


26 


XRESET 


0 


Transmit reset. If XRESET = 0, the transmit side of the serial port is reset. 
To take the transmit side of the serial port out of reset, set XRESET to 1 . 
However, do not set XRESET to 1 until at least three cycles after XRESET 
goes inactive. This applies only to system reset. Setting XRESET to 0 does 
not change the contents of any of the serial-port control registers. It places 
the transmitter in a state corresponding to the beginning of a frame of data. 
Resetting the transmitter generates a transmit interrupt. Reset this bit dur- 
ing the time the mode of the transmitter is set. You can toggle XFSM with- 
out resetting the global-control register. 
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Table 8-3. Serial-Port Global-Control Register Bits Summary (Concluded) 



Bit Name Reset Value Function 



27 RRESET 


0 


Receive reset. If RRESET = 0, the receive side of the serial port is reset. 
To take the receive side of the serial port out of reset, set RRESET to 1 . 
Setting RRESET to 0 does not change the contents of any of the serial- 
port control registers. It places the receiver in a state corresponding to the 
beginning of a frame of data. Reset this bit at the same time that the mode 
of the receiver is set. RFSM can be toggled without resetting the global- 
control register. 


31-28 Reserved 


0-0 


Read as 0. 



Figure 8-10. Serial-Port Global-Control Register 

31 30 29 28 27 26 25 24 23 22 21 20 19 18 17 16 

|xx | xx | xx | xx | RRESET| XRESEt| RINt] RTINT | XINT | XTINT | RLEN | XLEN | FSRP | FSXP | 







R/W 


R/W 


R/W 


R/W 


R/W 


R/W 


R/W 


R/W R/W 


R/W 


R/W 


R/W 


15 


14 


13 


12 


11 


10 


9 


8 


7 


6 


5 


4 


3 


2 


1 


0 


DRP 


DXP 


CLKRP 


CLKXP 


RFSM 


XFSM 


RVAREN 


XVAREN 


RCLK 
SRCE 


XCLK 
SRCE 


HS 


RSR 
FULL 


XSR 
EMPTY 


FSXOUT 


XRDY 


RRDY 


R/W 


R/W 


R/W 


R/W 


R/W 


R/W 


R/W 


R/W 


R/W 


R/W 


R/W 


R 


R 


R/W 


R 


R 



R = Read, W = Write, xx= reserved bit, read as 0 



8.2.2 FSX/DX/CLKX Port-Control Register 

This 32-bit port control register controls the function of the serial port FSX, DX, 
and CLKX pins. At reset, all bits are set to 0. Table 8-4 defines the register bits, 
bit names, and functions. Figure 8-11 shows this port control register. 



8-18 



Serial Ports 



Table 8-4. FSX/DX/CLKX Port-Control Register Bits Summary 



Bit Name 



Reset Value Function 



2 
3 
4 



6 
7 
8 



10 
11 



CLKXFUNC 

CLKXI/O 

CLKXDATOUT 

CLKXDATIN 

DXFUNC 

DXl/O 

DXDATOUT 

DXDATIN 

FSXFUNC 



9 FSX I/O 



FSXDATOUT 
FSXDATIN 



31-12 Reserved 
t x = 0or1 



0 CLKXFUNC controls the function of CLKX. If CLKXFUNC = 0, 

CLKX is configured as a general-purpose digital I/O port. If 
CLKXFUNC = 1 , CLKX is a serial port pin. 

0 If CLKX I/O = 0, CLKX is configured as a general-purpose input 
pin. If CLKX I/O = 1 , CLKX is configured as a general-purpose out- 
put pin. 

0 Data output on CLKX. 

x Data input on CLKX. A write has no effect. 

0 DXFUNC controls the function of DX. If DXFUNC = 0, DX is config- 

ured as a general-purpose digital I/O port. If DXFUNC = 1 , DX is 
a serial port pin. 

0 If DX I/O = 0, DX is configured as a general-purpose input pin. If 

DX 1/0 = 1, DX is configured as a general-purpose output pin. 

0 Data output on DX. 

xt Data input on DX. A write has no effect. 

0 FSXFUNC controls the function of FSX. If FSXFUNC = 0, FSX is 

configured as a general-purpose digital I/O port. If FSXFUNC = 1 , 
FSX is a serial port pin. 

0 If FSX J/O = 0, FSX is configured as a general-purpose input pin. 
If FSX I/O = 1 , FSX is configured as a general-purpose output pin. 

0 Data output on FSX. 

xt Data input on FSX. A write has no effect. 

0-0 Read as 0. 



Figure 8-11. FSX/DX/CLKX Port-Control Register 



31 


30 


29 


28 


27 


26 


25 


24 


23 


22 


21 


20 


19 


18 


17 


16 




XX 


« 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX | 


15 


14 


13 


12 


11 


10 


9 


8 


7 


6 


5 


4 


3 


2 


1 


0 


XX 


XX 


XX 


XX 


FSX 
DATIN 


FSX 
DATOUT 


FSX 
I/O 


FSX 
FUNC 


DX 
DATIN 


DX 
DATOUT 


DX 
I/O 


DX 
FUNC 


CLKX 
DATIN 


CLKX 
DATOUT 


CLKX 
I/O 


CLKX 
FUNC 










R 


R/W 


R/W 


R/W 


R 


R/W 


R/W 


R/W 


R 


R/W 


R/W 


R/W 



R = Read, W = Write, xx = reserved bit, read as 0 
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8.2.3 FSR/DR/CLKR Port-Control Register 



This 32-bit port control register is controlled by the function of the serial port 
FSR, DR, and CLKR pins. At reset, all bits are set to 0. Table 8-5 defines the 
register bits, the bit names, and functions. Figure 8-1 2 illustrates this port con- 
trol register. 



Table 8-5. FSR/DR/CLKR Port-Control Register Bits Summary 



Bit Name 



Reset Value Function 



2 
3 
4 



6 
7 
8 



CLKRFUNC 

CLKRI/O 

CLKRDATOUT 

CLKRDATIN 

DRFUNC 

DR I/O 

DRDATOUT 

DRDATIN 

FSRFUNC 



9 FSR I/O 



10 
11 



FSRDATOUT 
FSRDATIN 



31-12 Reserved 



0 CLKRFUNC controls the function of CLKR. If CLKRFUNC =0, 

CLKR is configured as a general-purpose digital I/O port. If 
CLKRFUNC = 1, CLKR is a serial port pin. 

0 If CLKRj/0 = 0, CLKR is configured as a general-purpose input pin. 

If CLKRi/O = 1 , CLKR is configured as a general-purpose output pin. 

0 Data output on CLKR. 

x Data input on CLKR. A write has no effect. 

0 DRFUNC controls the function of DR. If DRFUNC = 0, DR is 

configured as a general-purpose digital I/O port. If DRFUNC = 1 , DR 
is a serial port pin. 

0 If DR|/O=0, DR is configured as a general-purpose input pin. 

If DRI/0 = 1, DR is configured as a general-purpose output pin. 

0 Data output on DR 

x t Data input on DR. A write has no effect. 

0 FSRFUNC controls the function of FSR. If FSRFUNC = 0, FSR is 

configured as a general-purpose digital I/O port. If 
FSRFUNC = 1, FSR is a serial port pin. 

0 If FSR T/O = 0, FSR is configured as a general-purpose input pin. If 

FSR 1/0 = 1 , FSR is configured as a general-purpose output pin. 

0 Data output on FSR 

x Data input on FSR. A write has no effect. 

0-0 Read as 0. 



t x = 0 or 1 

Figure 8-12. FSR/DR/CLKR Port-Control Register 
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XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 
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15 


14 
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7 


6 
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2 


1 


0 


XX 


XX 


XX 


XX 


FSR 
DATIN 


FSR 
DATOUT 


FSR 
I/O 


FSR 
FUNC 


DR 
DATIN 


DR 
DATOUT 


DR 

I/O 


DR 
FUNC 


CLKR 
DATIN 


CLKR 
DATOUT 


CLKR 
I/O 


CLKR 
FUNC 










R 


R/W 


R/W 


R/W 


R 


R/W 


R/W 


R/W 


R 


R/W 


R/W 


R/W 



R = Read, W = Write, xx= reserved bit, read as 0 
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8.2.4 Receive/Transmit Timer-Control Register 

A 32-bit receive/transmit timer control register contains the control bits for the 
timer module. At reset, all bits are set to 0. Table 8-6 lists the register bits, bit 
names, and functions. Bits 5-0 control the transmitter timer. Bits 1 1 -6 control 
the receiver timer. Figure 8-13 shows the register. The serial port receive/ 
transmit timer function is similar to timer module operation. It can be consid- 
ered a 16-bit-wide timer. Refer to Section 8.1 on page 8-2 for more informa- 
tion on timers. 



Table 8-6. Receive/Transmit Timer-Control Register 



Bit 


Name 


Reset Value 


Function 


0 


XGO 


0 


The XGO bit resets and starts the transmit timer counter. When XGO 
is set to 1 and the timer is not held, the counter is zeroed and begins 
inrrpmpntinn on thp npyt rtainn peine* of thp timpr inm it Hook Thp XfiO 
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bit is cleared on the same rising edge. Writing 0 to XGO has no effect 
on the transmit timer. 


1 


XHLD 


0 


Transmit counter hold signal. When this bit is set to 0, the counter is dis- 
abled and held in its current state. The internal divide-by-two counter 
is also held so that the counter will continue where it left off when XHLD 
is set to 1 . You can read and modify the timer registers while the timer 
is being held. RESET has priority over XHLD. 


2 


XC/P 


0 


XCIock/Pulse mode control. When XC/P = 1 , the clock mode is chosen. 
The signaling of the status flag and external output has a 50 percent 
duty cycle. When XC/P = 0, the status flag and external output are ac- 
tive for one CLKOUT cycle during each timer period. 


3 


XCLKSRC 


0 


This bit specifies the source of the transmit timer clock. When 
XCLKSRC = 1 , an internal clock with frequency equal to one-half the 
CLKOUT frequency is used to increment the counter. When XCLKSRC 
= 0, you can use an external signal from the CLKX pin to increment the 
counter. The external clock source is synchronized internally, thus al- 
lowing for external asynchronous clock sources that do not exceed the 
specified maximum allowable external clock frequency, that is, less 
than f(H1)/2.6. 


4 


Reserved 


0 


Read as zero. 


5 


XTSTAT 


0 


This bit indicates the status of the transmit timer. It tracks what would 
be the output of the uninverted CLKX pin. This flag sets a CPU interrupt 
on a transition from 0 to 1 . A write has no effect. 


6 


RGO 


0 


The RGO bit resets and starts the receive timer counter. When RGO 
is set to 1 and the timer is not held, the counter is zeroed and begins 
incrementing on the next rising edge of the timer input clock. The RGO 
bit is cleared on the same rising edge. Writing 0 to RGO has no effect 
on the receive timer. 


7 


RHLD 


0 


Receive counter hold signal. When this bit is set to 0, the counter is dis- 
abled and held in its current state. The internal divide-by-two counter 
is also held so that the counter will continue where it left off when RHLD 
is set to 1 . You can read and modify the timer registers while the timer 
is being held. RESET has priority over RHLD. 
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Table 8-6. Receive/Transmit Timer-Control Register (Concluded) 





Bit 


Name 


Reset Value 


Function 




8 
9 


RC/P 

RCLKSRC 




0 
0 




RCIock/Pulse mode control. When RC/P = 1 , the clock mode is cho- 
sen. The signaling of the status flag and external output has a 50 per- 
cent duty cycle. When RC/P = 0, the status flag and external output 
are active for one CLKOUT cycle during each timer period. 

This bit specifies the source of the receive timer clock. When 

nULrxOnv = 1 , all II UolTlal CIOUK Willi Tl cLjUcllCy @C|Ual lO uno-llcUT 1116 

CLKOUT frequency is used to increment the counter. When 
RCLKSRC = 0, you can use an external signal from the CLKR pin to 
increment the counter. The external clock source is synchronized in- 
ternally, thus allowing for external asynchronous clock sources that 
qo nox exceea ine speciiiea maximum anowauie exiernai ciock ire- 
quency, that is, less than f(H1)/2.6. 




10 


Reserved 




0 




Read as zero. 




11 


RTSTAT 




0 




This bit indicates the status of the receive timer. It tracks what would 
be the output of the uninverted CLKR pin. This flag sets a CPU inter- 
rupt on a transition from 0 to 1 . A write has no effect. 


31 


—12 


Reserved 




0-0 




Read as 0. 


Figure 8-13. Receive/Transmit Timer-Control Register 
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Read, W= Write, xx = 


reserved bit, read as 0 



8.2.5 Receive/Transmit Timer-Counter Register 

The receive/transmit timer counter register is a 32-bit register (see 
Figure 8-1 4) . Bits 1 5-0 are the transmit timer counter, and bits 31 — 1 6 are the 
receive timer counter. Each counter is cleared to 0 whenever it increments to 
the value of the period register (see Section 8.2.6). It is also set to 0 at reset. 

Figure 8-14. Receive/Transmit Timer Counter Register 
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NOTE: All bits are read/write. 
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8.2.6 Receive/Transmit Timer-Period Register 



The receiveAransmit timer period register is a 32-bit register (see 
Figure 8-1 5). Bits 1 5-0 are the timer transmit period, and bits 31 -1 6 are the 
receive period. Each register is used to specify the period of the timer. It is also 
cleared to 0 at reset. 



Figure 8-75. Receive/Transmit Timer-Period Register 
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Note: Ail bits are read/write. 



8.2.7 Data-Transmit Register 



When the data-transmit register (DXR) is loaded, the transmitter loads the 
word into the transmit shift register (XSR), and the bits are shifted out. The 
delay from a write to DXR until an FSX occurs (or can be accepted) is two 
CLKX cycles. The word is not loaded into the shift register until the shifter is 
empty. When DXR is loaded into XSR, the XRDY bit is set, specifying that the 
buffer is available to receive the next word. Four tap points within the transmit 
shift register are used to transmit the word. These tap points correspond to the 
four data word sizes and are illustrated in Figure 8-1 6. The shift is a left-shift 
(LSB to MSB) with the data shifted out of the MSB corresponding to the appro- 
priate tap point. 



Figure 8-16. Transmit Buffer Shift Operation 
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8.2.8 Data-Receive Register 

When serial data is input, the receiver shifts the bits into the receive shift regis- 
ter (RSR). When the specified number of bits are shifted in, the data-receive 
register (DRR) is loaded from RSR, and the RRDY status bit is set. The receiv- 
er is double-buffered. If the DRR has not been read and the RSR is full, the 
receiver is frozen. New data coming into the DR pin is ignored. The receive 
shifter will not write over the DRR. The DRR must be read to allow new data 
in the RSR to be transferred to the DRR. When a write to DRR occurs at the 
same time that an RSR to DRR transfer takes place, the RSR to DRR transfer 
has priority. 

Data is shifted to the left (LSB to MSB). Figure 8-1 7 illustrates what happens 
when words less than 32 bits are shifted into the serial port. In this figure, it is 
assumed that an 8-bit word is being received and that the upper three bytes 
of the receive buffer are originally undefined. In the first portion of the figure, 
byte a has been shifted in. When byte b is shifted in, byte a is shifted to the left. 
When the data receive register is read, both bytes a and b are read. 



Figure 8-1 7. Receive Buffer Shift Operation 
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8.2.9 Serial-Port Operation Configurations 

Several configurations are provided for the operation of the serial port clocks 
and timer. The clocks for each serial port can originate either internally or exter- 
nally. Figure 8-18 shows serial port clocking in the I/O mode (CLKRFUNC = 
0) when CLKX is either an input or an output. Figure 8-19 shows clocking in 
the serial-port mode (CLKRFUNC=1). Both figures use a transmit section for 
an example. The same relationship holds for a receive section. 
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Figure 8-18. Serial-Port Clocking in I/O Mode 
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Figure 8-19. Serial-Port Clocking in Serial-Port Mode 
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8.2.10 Serial-Port Timing 

The formula for calculating the frequency of the serial-port clock with an inter- 
nally generated clock is dependent upon the operation mode of the serial-port 
timers, defined as 

f (pulse mode) = f (timer clock)/period register 

f (clock mode) = f (timer clock)/(2 x period register) 

An internally generated clock source f (timer clock) has a maximum frequency 
of f(H1)/2. An externally generated serial-port clock f (timer clock) (CLKX or 
CLKR) has a maximum frequency of less than f(H1)/2.6. See serial port timing 
in Table 13-27 on page 13-58. Also, see subsection 8.1 .3 on page 8-8 for in- 
formation on timer pulse/clock generation. 



8-26 



Serial Ports 



Transmit data is clocked out on the rising edge of the selected serial-port clock. 
Receive data is latched into the receive shift register on the falling edge of the 
serial-port clock. All data is transmitted and loaded MSB first and right-justi- 
fied. If fewer than 32 bits are transferred, the data are right-justified in the 32-bit 
transmit and receive buffers. Therefore, the LSBs of the transmit buffer are 
the bits that are transmitted. 

The transmit ready (XRDY) signal specifies that the data-transmit register 
(DXR) is available to be loaded with new data. XRDY goes active as soon as 
the data is loaded into the transmit shift register (XSR) . The last word may still 
be shifting out when XRDY goes active. If DXR is loaded before the last word 
has completed transmission, the data bits transmitted are consecutive; that is, 
the LSB of the first word immediately precedes the MSB of the second, with 
all signaling valid as in two separate transmits. XRDY goes inactive when DXR 
is loaded and remains inactive until the data is loaded into the shifter. 

The receive ready (RRDY) signal is active as long as a new word of data is 
loaded into the data receive register and has not been read. As soon as the 
data is read, the RRDY bit is turned off. 

When FSX is specified as an output, the activity of the signal is determined 
solely by the internal state of the serial port. If afixed data rate is specified, FSX 
goes active when DXR is loaded into XSR to be transmitted out. One serial- 
clock cycle later, FSX turns inactive, and data transmission begins. If a variable 
data rate is specified, the FSX pin is activated when the data transmission be- 
gins and remains active during the entire transmission of the word. Again, the 
data is transmitted one clock cycle after it is loaded into the data transmit 
register. 

An input FSX in the fixed data rate mode should go active for at least one serial 
clock cycle and then inactive to initiate the data transfer. The transmitter then 
sends the number of bits specified by the LEN bits. In the variable data-rate 
mode, the transmitter begins sending from the time FSX goes active until the 
number of specified bits has been shifted out. In the variable data-rate mode, 
when the FSX status changes prior to all the data bits being shifted out, the 
transmission completes, and the DX pin is placed in a high-impedance state. 
An FSR input is exactly complementary to the FSX. 

When using an external FSX, if DXR and XSR are empty, a write to DXR results 
in a DXR-to-XSR transfer. This data is held in the XSR until an FSX occurs. 
When the external FSX is received, the XSR begins shifting the data. If XSR 
is waiting for the external FSX, a write to DXR will change DXR, but a DXR-to- 
XSR transfer will not occur. XSR begins shifting when the external FSX is re- 
ceived, or when it is reset using XRESET. 
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Continuous Transmit and Receive Modes 

When continuous mode is chosen, consecutive writes do not generate or ex- 
pect new sync pulse signaling. Only the first word of a block begins with an ac- 
tive synchronization. Thereafter, data continues to be transmitted as long as 
new data is loaded into DXR before the last word has been transmitted. As 
soon as TXRDY is active and all of the data has been transmitted out of the 
shift register, the DX pin is placed in a high-impedance state, and a subsequent 
write to DXR initiates a new block and a new FSX. 

Similarly with FSR, the receiver continues shifting in new data and loading 
DRR. If the data-receive buffer is not read before the next word is shifted in, 
you will lose subsequent incoming data. You can use the RFSM bit to terminate 
the receive-continuous mode. 

Handshake Mode 

The handshake mode (HS = 1) allows for direct connection between proces- 
sors. In this mode, all data words are transmitted with a leading 1 (see 
Figure 8-20). For example, if an eight-bit word is to be transmitted, the first bit 
sent is a 1 , followed by the eight-bit data word. 

In this mode, once the serial port transmits a word, it will not transmit another 
word until it receives a separately transmitted zero bit. Therefore, the 1 bit that 
precedes every data word is, in effect, a request bit. 

Figure 8-20. Data Word Format in Handshake Mode 

|* Data Word (8 Bits) *\ 




After a serial port receives a word (with the leading 1 ) and that word has been 
read from the DRR, the receiving serial port sends a single 0 to the transmitting 
serial port. Thus, the single 0 bit acts as an acknowledge bit (see Figure 8-21 ) . 
This single acknowledge bit is sent every time the DRR is read, even if the DRR 
does not contain new data. 

Figure 8-21. Single Zero Sent as an Acknowledge Bit 
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When the serial port is placed in the handshake mode, the insertion and dele- 
tion of a leading 1 for transmitted data, the sending of a 0 for acknowledgement 
of received data, and the waiting for this acknowledge bit are all performed au- 
tomatically. Using this scheme, it is simple to connect processors with no exter- 
nal hardware and to guarantee secure communication. Figure 8-22 is a typi- 
cal configuration. 

In the handshake mode, FSX is automatically configured as an output. Contin- 
uous mode is automatically disabled. After a system reset or XRESET, the 
transmitter is always permitted to transmit. The transmitter and receiver must 
be reset when entering the handshake mode. 

Figure 8-22. Direct Connection Using Handshake Mode 
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8.2.11 Serial-Port Interrupt Sources 

A serial port has the following interrupt sources: 

□ The transmit timer interrupt: The rising edge of XTSTAT causes a sing- 
le-cycle interrupt pulse to occur. When XTINT is 0, this interrupt pulse is 
disabled. 

□ The receive timer interrupt: The rising edge of RTSTAT causes a single- 
cycle interrupt pulse to occur. When RTINT is 0, this interrupt pulse is dis- 
abled. 

□ The transmitter interrupt: Occurs immediately following a DXR-to-XSR 
transfer. The transmitter interrupt is a single-cycle pulse. When the 
serial-port global-control register bit XI NT is 0, this interrupt pulse is dis- 
abled. 

□ The receiver interrupt: Occurs immediately following an RSR to DRR 
transfer. The receiver interrupt is a single-cycle pulse. When the 
serial-port global-control register bit RINT is 0, this interrupt pulse is 
disabled. 

The transmit timer interrupt pulse is ORed with the transmitter interrupt pulse to create 
the CPU transmit interrupt flag XI NT. The receive timer interrupt pulse is ORed with the 
receiver interrupt pulse to create the CPU receive interrupt flag RINT 
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8.2.12 Serial-Port Functional Operation 

The following paragraphs and figures illustrate the functional timing of the vari- 
ous serial-port modes of operation. The timing descriptions are presented with 
the assumption that all signal polarities are configured to be positive, that is, 
CLKXP = CLKRP = DXP = DRP = FSXP = FSRP = 0. Logical timing, in situa- 
tions where one or more of these polarities are inverted, is the same except 
with respect to the opposite polarity reference points, that is, rising vs. falling 
edges, etc. 

These discussions pertain to the numerous operating modes and configura- 
tions of the serial-port logic. When it is necessary to switch operating modes 
or change configurations of the serial port, you should do so only when 
XRESET or RRESET are asserted ( low), as a ppropriate. Therefore, when 
transmit configurations are modifie d, XRESE T should be low, and when re- 
ceive configurations are modified, RRESET should be low. When you use 
handshake mode, however, since the transmitter and receiver are interrelated, 
you should make any configuration changes with XRESET and RRESET both 
low. 

All of the serial-port operating configurations can be broadly classified in two 
categories: fixed data-rate timing and variable data-rate timing. The following 
paragraphs discuss fixed and variable data-rate operation and all of their vari- 
ations. 

Fixed Data-Rate Timing Operation 

Fixed data-rate serial-port transfers can occur in two varieties: burst mode and 
continuous mode. In burst mode, transfers of single words are separated by 
periods of inactivity on the serial port. In continuous mode, there are no gaps 
between successive word transfers; the first bit of a new word is transferred 
on the next CLKX/R pulse following the last bit of the previous word. This oc- 
curs continuously until the process is terminated. 

In burst mode with fixed data-rate timing, FSX/FSR pulses initiate transfers, 
and each transfer involves a single word. With an internally generated FSX 
(see Figure 8-23), transmission is initiated by loading DXR. In this mode, 
there is a delay of approximately 2.5 CLKX cycles (depending on CLKX and 
H1 frequencies) from the time DXR is loaded until FSX occurs. With an exter- 
nal FSX, the FSX pulse initiates the transfer, and the 2.5-cycie delay effectively 
becomes a setup requirement for loading DXR with respect to FSX. Therefore, 
in this case, you must load DXR no later than three CLKX cycles before FSX 
occurs. Once the XSR is loaded from the DXR, an XINT is generated. 
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Figure 8-23. Fixed Burst Mode 
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In receive operations, once a transfer is initiated, FSR is ignored until the last 
bit. For burst-mode transfers, FSR must be low during the last bit, or another 
transfer will be initiated. After a full word has been received and transferred to 
the DRR, an RINT is generated. 

In fixed data-rate mode, you can perform continuous transfers even if R/XFSM 
= 0, as long as properly timed frame synchronization is provided, or as long 
as DXR is reloaded each cycle with an internally generated FSX (see 
Figure 8-24). 



Figure 8-24. Fixed Continuous Mode With Frame Sync 
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For receive operations and with externally generated FSX, once transfers 
have begun, frame sync pulses are required only during the last bit transferred 
to initiate another contiguous transfer. Otherwise, frame sync inputs are ig- 
nored. Therefore, continuous transfers will occur if frame sync is held high. 
With an internally generated FSX, there is a delay of approximately 2.5 CLKX 
cycles from the time DXR is loaded until FSX occurs. This delay occurs each 
time DXR is loaded; therefore, during continuous transmission, the instruction 
that loads DXR must be executed by the A/-3 bit for an /V-bit transmission. 
Since delays due to pipelining may vary, you should incorporate a conserva- 
tive margin of safety in allowing for this delay. 

Once the process begins, an XI NT and an RINT are generated at the begin- 
ning of each transfer. The XI NT indicates that the XSR has been loaded from 
DXR and can be used to cause DXR to be reloaded. To maintain continuous 
transmission in fixed rate mode with frame sync, especially with an internally 
generated FSX, DXR must be reloaded early in the ongoing transfer. 

The RINT indicates that a full word has been received and transferred into the 
DRR. RINT is therefore commonly used to indicate an appropriate time to read 
DRR. 

Continuous transfers are terminated by discontinuing frame sync pulses or, in 
the case of internally generated FSX, not reloading DXR. 

You can accomplish continuous serial-port transfers without the use of frame 
sync pulses if R/XFSM are set to 1 . In this mode, operation of the serial port 
is similar to continuous operation with frame sync, except that a frame sync 
pulse is involved only in the first word transferred, and no further frame sync 
pulses are used. Following the first word transferred (see Figure 8-25), no in- 
ternal frame sync pulses are generated, and frame sync inputs are ignored. 
Additionally, you should set R/XFSM prior to or during the first word trans- 
ferred; you must set R/XFSM no later than the transfer of the AM bit of the first 
word, except for transmit operations. For transmit operations in the fixed data- 
rate mode, XFSM must be set no later than the N-2 bit. You must clear 
R/XFSM no later than the AM bit to be recognized in the current cycle. 
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Figure 8-25. Fixed Continuous Mode Without Frame Sync 
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Timing of RINT and XINT and data transfers to and from DXR and DRR, re- 
spectively, are the same as in fixed data-rate continuous mode with frame 
sync. This mode of operation also exhibits the same delay of 2.5 CLKX cycles 
after DXR is loaded before an internal FSX is generated. As in the case of con- 
tinuous operation in fixed data-rate mode with frame sync, you must reload 
DXR no later than transmission of the A/-3 bit. 

When you use continuous operation in fixed data-rate mode, R/XFSM can be 
set and cleared as desired, even during active transfers, to enable or disable 
the use of frame sync pulses as dictated by system requirements. Under most 
conditions, the effect of changing the state of R/XFSM occurs during the trans- 
fer in which the R/XFSM change was made, provided the change was made 
early enough in the transfer. For transmit operations with internal FSX in fixed 
data-rate mode, however, a one-word delay occurs before frame sync pulse 
generation resumes when clearing XFSM to 0 (see Figure 8-26). Therefore, 
in this case, one additional word is transferred before the next FSX pulse is 
generated. Also note that, as discussed previously, the clearing of XFSM is 
recognized during the transmission of the word currently being transmitted as 
long as XFSM is cleared no later than the AM bit. The setting of XFSM is rec- 
ognized as long as XFSM is set no later than the N-2 bit. 
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Figure 8-26. Exiting Fixed Continuous Mode Without Frame Sync, FSX Internal 
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Variable Data-Rate Timing Operation 

Variable data-rate timing also supports operation in either burst or continuous 
mode. Burst-mode operation with variable data-rate timing is similar to burst- 
mode operation with fixed data-rate timing. With variable data-rate timing (see 
Figure 8-27), however, FSX/R and data timing differ slightly at the beginning 
and end of transfers. Specifically, there are three major differences between 
fixed and variable data-rate timing: 

□ FSX/R pulses typically last for the entire transfer interval, although FSR 
and external FSX are ignored after the first bit transferred. FSX/R pulses 
in fixed data-rate mode typically last only one CLKX/R cycle but can last 
longer. 

□ Data transfer begins during the CLKX/R cycle in which FSX/R occurs, 
rather than the CLKX/R cycle following FSX/R, as is the case with fixed 
data-rate timing. 

□ With variable data-rate timing, frame sync inputs are ignored until the end 
of the last bit transferred, rather than the beginning of the last bit trans- 
ferred, as is the case with fixed data-rate timing. 
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Figure 8-27. Variable Burst Mode 
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When you transmit continuously in variable data-rate mode with frame sync, 
timing is the same as for fixed data-rate mode, except for the differences be- 
tween these two modes as described under Variable Data-Rate Timing Opera- 
tion. The only other exception is that you must reload DXR no later than the 
AM bit to maintain continuous operation of the variable data-rate mode (see 
Figure 8-28); you must reload DXR no later than the N-3 bit to maintain con- 
tinuous operation of the fixed data-rate mode. 



Figure 8-28. Variable Continuous Mode With Frame Sync 
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Continuous operation in variable data-rate mode without frame sync (see 
Figure 8-29) is also similar to continuous operation without frame sync in fixed 
data-rate mode. As with variable data-rate mode continuous operation with 
frame sync, you must reload DXR no later than the AM bit to maintain continu- 
ous operation. Additionally, when R/XFSM is set or cleared in the variable da- 
ta-rate mode, you must make the modification no later than the AM bit for the 
result to be affected in the current transfer. 
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Figure 8-29. Variable Continuous Mode Without Frame Sync 
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8.2.13 Serial-Port Initialization/Reconfiguration 

The serial ports are controlled through memory-mapped registers on the dedi- 
cated peripheral bus. Following is a general procedure for initializing and/or 
reconfiguring the serial ports. 



1 ) Halt the serial port by clearing the XRESET and/or RRESET bits of the ser- 
ial-port global-control register. To do this, write a 0 to the serial-port global- 
control register. Note that the serial ports are halted on RESET. 

2) Configure the serial port via the serial-port global-control register (with 
XRESET = RRESET = 0) and the FSX/DX/CLKX and FSR/DR/CLKR port- 
control registers. If necessary, configure the receiveAransmit registers: 
timer control (with XHLD = RHLD = 0), timer counter, and timer period. Re- 
fer to subsection 8.2.14 for more information. 



3) Start the serial port operation by setting the XR ESET and RRESE T bits 
of the serial-port global-control register and the XHLD and RHLD bits of 
the serial-port receiveAransmit timer-control register, if necessary. 



8.2.14 TMS320C3x Serial-Port Interface Examples 

In addition to the examples presented in this section, DMA/serial port initializa- 
tion examples can be found in Example 8-6 and Example 8-7 on pages 8-59 
and 8-61 , respectively. 
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8.2. 14. 1 Handshake Mode Example 



When handshake mode is used, the transmit (FSX/DS/CLKX) and receive 
(FSR/DR/CLKR) signals transmit and receive data, respectively. In other 
words, even if the TMS320C3x serial port is receiving data only with hand- 
shake mode, the transmit signals are still needed to transmit the acknowledge 
signal. This is the serial port register setup for the TMS320C3x serial port 
handshake communication, as shown in Figure 8-22 on page 8-29: 

Global control = 011 x0x0xxxx00000000xx01 1 001 00b 

Transmit port control = 0111 h 

Receive port control = 0111 h 

S_port timer control = OFh 

S_port timer count = Oh 

S_port timer period * 01 h (if two C3xs have the same 

system clock) 

x - user-configurable 



Since the FSX is set as an output and continuous mode is disabled when hand- 
shake mode is selected, you should set the XFSM and RFSM bits to 0 and the 
FSXOUT bit to 1 in the global control register. You should set the XRESET, 
RRESET, and HS bits to 1 in order to start the handshake communication. You 
should set the polarity of the serial port pins active (high) for simplification. Al- 
though the CLKX/CLKR can be set as either input or output, you should set 
the CLKX as output and the CLKR as input. The rest of the bits are user-confi- 
gurable as long as both serial ports have consistent setup. 

You need the serial port timer only if the CLKX or CLKR is configured as an 
output. Since only the CLKX is configured as an output, you should set the tim- 
er control register to OFh. When the serial port timer is used, you should also 
set the serial timer register to the proper value for the clock speed. The serial 
port timer clock speed setup is similar to the TMS320C3x timer. Refer to Sec- 
tion 8.1 on page 8-2 for detailed information on timer clock generation. 

The maximum clock frequency for serial transfers is F(CLKIN)/4 if the internal 
clock is used and F(CLKIN)/5.2 if an external clock is used. Therefore, if two 
TMS320C3xs have the same system clock, the timer period register should 
be set equal to or greater than 1 , which makes the clock frequency equal to 
F(CLKIN)/8. 

Example 8-1 and Example 8-2 are serial port register setups for the above 
case. (Assume two TMS320C3xs have the same system clock.) 
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Example 8-1. Serial-Port Register Setup #1 

Global control « 0EBC0064h; 32 bits, fixed data rate, burst mode, 

Transmit port control - Olllh ; FSX (output), CLKX (output) - F(CLKIN)/8 

Receive port control = Olllh ; CLKR (input), handshake mode, transmit 

S__port timer control « OFh ; and receive interrupt is enabled. 

S_j?ort timer count = Oh 

S_port timer period Olh 

Example 8-2. Serial-Port Register Setup #2 

Global control = 0C000364h; 8 bits, variable data rate, burst mode, 

Transmit port control * Olllh; FSX (output), CLKX (output) « f (CLKIN)/24 

Receive port control = Olllh ; CLKR (input), handshake mode, transmit 

S__port timer control » OFh ; and receive interrupt is disabled. 

S_port timer count = Oh 

S_port timer period 2 Olh 

Since the data has a leading 1 and the acknowledge signal is a 0 in the hand- 
shake mode, the TMS320C3x serial port can distinguish between the data and 
the acknowledge signal. Therefore, even if the TMS320C3x serial port re- 
ceives the data before the acknowledge signal, the data will not be misinter- 
preted as the acknowledge signal and be lost. In addition, the acknowledge 
signal is not generated until the data is read from the data receive register 
(DRR). Therefore, the TMS320C3x will not transmit the data and the acknowl- 
edge signal simultaneously. 

8.2. 14.2 CPU Transfer With Serial-Port Transmit Polling Method 

Example 8-3 sets up the CPU to transfer data (1 28 words) from an array buffer 
to the serial port 0 output register when the previous value stored in the serial 
port output register has been sent. Serial port 0 is initialized to transmit 32-bit 
data words with an internally generated frame sync and a bit-transfer rate of 
8H1 cycles/bit. 
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Example 8-3. CPU Transfer With Serial-Port Transmit Polling Method 



* TITLE: 



SOURCE 



CPU TRANSFER WITH SERIAL-PORT TRANSMIT POLLING METHOD 



.GLOBAL START 
.DATA 

.WORD _ARRAY 
.BSS _ARRAY,128 



SPORT 

SPRESET 

SGCCTRL 

SXCTRL 

STCTRL 

STPERIOD 

RESET 

START 



.WORD 808040H 

.WORD 008C0044 

.WORD 048C0044H 

.WORD 111H 

.WORD OOFH 

.WORD 00000002h 

.WORD OH 

.TEXT 
LDP RESET 

ANDN 10H,IE 



SERIAL PORT INITIALIZATION 
LDI @ SPORT, ARl 
LDI @ RESET, RO 
LDI 4,IR0 
STI R0,*+AR1(IR0) 
LDI § SPRESET, RO 
STI R0,*AR1 
LDI § SXCTRL, RO 
STI R0,*+AR1(3) 
LDI § STPERIOD, RO 
STI R0,*+AR1(6) 
LDI 8 STCTRL, RO 
STI R0,*+AR1(4) 
LDI § SGCCTRL, RO 
STI R0,*AR1 



DATA ARRAY LOCATED IN .BSS SECTION 
THE UNDERSCORE USED IS JUST TO MAKE IT 
ACCESSIBLE FROM C (OPTIONAL) 
SERIAL-PORT GLOBAL CONTROL REG ADDRESS 
SERIAL-PORT RESET 

SERIAL-PORT GLOBAL CONTROL REG INITIALIZATION 
SERIAL-PORT TX PORT CONTROL REG INITIALIZATION 
SERIAL-PORT TIMER CONTROL REG INITIALIZATION 
SERIAL-PORT TIMER PERIOD 
SERIAL-PORT TIMER RESET VALUE 

LOAD DATA PAGE POINTER 

DISABLE SERIAL-PORT TRANSMIT INTERRUPT TO CPU 



SERIAL-PORT TIMER RESET 
SERIAL-PORT RESET 

SERIAL-PORT TX CONTROL REG INITIALIZATON 
SERIAL-PORT TIMER PERIOD INITIALIZATION 
SERIAL-PORT TIMER CONTROL REG INITIALIZATION 
SERIAL-PORT GLOBAL CONTROL REG INITIALIZATION 



* CPU WRITES THE FIRST WORD 



LDI § SOURCE, ARO 
LDI *AR0++,R1 
STI R1,*+AR1(8) 



* CPU WRITES 127 WORDS TO THE SERIAL PORT OUTPUT REG 



LDI 8,IR0 
LDI 2,R0 
LDI 126,RC 
RPTB LOOP 

WAIT AND *AR1,R0,R2 ; WAIT UNTIL XRDY BIT - 1 

BZ WAIT 
LOOP STI R1,*+AR1(IR0) 
| | LDI *++AR0(l) ,R1 

BU $ 

• END 
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8.2.14.3 Serial AIC Interface Example 

The TLC320C4x analog interface chips (AIC) from Texas Instruments offer a 
zero-glue-logic interface to the TMS320C3x family of DSPs. The interface is 
shown in Figure 8-30 as an example of the TMS320C3x serial-port configura- 
tion and operation. 



Figure 8-30. TMS320C3x Zero-Glue-Logic Interface to TLC3204X Example 
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The TMS320C3x resets the AIC through the external pin XFO. It also gener- 
ates the master clock for the AIC through the timer 0 output pin, TCLKO. (Pre- 
cise selection of a sample rate may require the use of an external oscillator 
rather than the TCLKO output to drive the AIC MCLK input.) In turn, the AIC 
generates the CLKRO and CLKXO shift clocks as well as the FSRO and FSXO 
frame synchronization signals. 

A typical use of the AIC requires an 8-kHz sample rate of the analog signal. 
If the clock input frequency to the TMS320C3x device is 30 MHz, you should 
load the following values into the serial port and timer registers. 

Serial Port: 

Port global control register: 0E970300h 

FSX/DX/CLKX port control register 000001 1 1 h 

FSR/DR/CLKR port control register 000001 1 1 h 

Timer: 

Timer global control register 000002C1 h 

Timer period register 00000001 h 



8.2.14.4 Serial A/D and D/A Interface Example 

The DSP201/2 and DSP1 01/2 family of D/As and A/Ds from Burr Brown also 
offer a zero-glue-logic interface to the TMS320C3x family of DSPs. The inter- 
face is shown in Example 8-4. This interface is used as an example of the 
TMS320C3x serial-port configuration and operation. 
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Example 8-4. TMS320C3x Zero-Glue-Logic Interface to Burr Brown A/D and D/A 



Burr Brown DSP102 A/D 



Burr Brown DSP202 D/A 



± 2.75 V - 
± 2.75 V - 



22 pF 



1 MOhm 
12.29 MHz 

HOI— 



VINA 
VINB 

OSCO 
OSC1 



CASC 

XCLK 

SOUTA 
SYNC 



CONV 



■+5V 



+5V- 



TMS320C3X 



4 + ► 



SSF — +5V 



CLKRO CLKXO 
DRO DXO 
FSRO 

FSXO 



TCLKO 



+5V- 
+5V- 



CASC 

XCLK 

SINA 
SINB 
SYNC 



VOUTA 



VOUTB 



SSF 
SWL 
CONV 



22 pF 



-> ±3V 



-> ±3V 



The DSP1 02 A/D is interfaced to the TMS320C3x serial port receive side; the 
DSP202 D/A is interfaced to the transmit side. The A/Ds and D/As are hard- 
wired to run in cascade mode. In this mode, when the TMS320C3x initiates a 
convert command to the A/D via the TCLKO pin, both analog inputs are con- 
verted into two 1 6-bit words, which are concatenated to form one 32-bit word. 
The A/D signals the TMS320C3x via the A/D's SYNC signal (connected to the 
TMS320C3x FSRO pin) that serial data is to be transmitted. The 32-bit word 
is then serially transmitted, MSB first, out the SOUTA serial pin of the DSP102 
to the DRO pin of the TMS320C3x serial port. The TMS320C3x is programmed 
to drive the analog interface bit clock from the CLKXO pin of the TMS320C3x. 
The bit clock drives both the A/D's and D/A's XCLK input. The TMS320C3x 
transmit clock also acts as the input clock on the receive side of the 
TMS320C3x serial port. Since the receive clock is synchronous to the internal 
clock of the TMS320C3x, the receive clock can run at full speed (that is, 
f(H1)/2). 
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Similarly, on receiving a convert command, the pipelined D/A converts the last 
word received from the TMS320C3x and signals the TMS320C3x via the 
SYNC signal (connected to the TMS320C3x FSXO pin) to begin transmitting 
a 32-bit word representing the two channels of data to be converted. The data 
transmitted from the TMS320C3x DXO pin is input to both the SINA and SINB 
inputs of the D/A as shown in the figure. 

The TMS320C3x is set up to transfer bits at the maximum rate of about eight 
Mbps, with a dual-channel sample rate of about 44. 1 kHz. Assuming a 32-MHz 
CLKIN, you can configure this standard-mode fixed-data-rate signaling inter- 
face by setting the registers as described below: 

Serial Port: 

Port global-control register: 0EBC0040h 

FSX/DX/CLKX port-control register 000001 1 1 h 

FSR/DR/CLKR port-control register 000001 1 1 h 

ReceiveAransmit timer-control register OOOOOOOFh 



Timer: 

Timer global-control register 
Timer period register 



000002C1h 
000000B5h 
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8.3 DMA Controller 

The TMS320C3x has an on-chip direct memory access (DMA) controller that 
reduces the need for the CPU to perform input/output functions. The DMA con- 
troller can perform input/output operations without interfering with the opera- 
tion of the CPU. Therefore, it is possible to interface the TMS320C3x to slow 
external memories and peripherals (A/Ds, serial ports, etc.) without reducing 
the computational throughput of the CPU. The result is improved system per- 
formance and decreased system cost. 

A DMA transfer consists of two operations: a read from a memory location and 
a write to a memory location. The DMA controller can read from and write to 
any location in the TMS320C3x memory map. This includes all 
memory-mapped peripherals. The operation of the DMA is controlled with the 
following set of memory-mapped registers: 

□ DMA global-control register 

□ DMA source-address register 

□ DMA destination-address register 

□ DMA transfer-counter register 

Table 8-7 shows these registers, their memory-mapped addresses, and their 
functions. Each of these DMA registers is discussed in the succeeding subsec- 
tions. 
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Table 8-7. Memory-Mapped Locations for a DMA Channel 





Peripheral 


Register 


Address 


DMA Global Control (See Table 8-6) 


808000b 


Reserved 


808001 h 


Reserved 


808002b 


Reserved 


808003h 


DMA Source Address (see subsection 8.3.2) 


808004h 


Reserved 


808005b 


DMA Destination Address (see subsection 8.3.2) 


808006b 


Reserved 


808007h 


DMA Transfer Counter (see subsection 8.3.3) 


808008b 


Reserved 


808009b 


Reserved 


80800Ab 


Reserved 


80800Bh 


Reserved 


80800Ch 


Reserved 


80800Dh 


Reserved 


80800Eh 


Reserved 


80800Fh 



8-44 



DMA Controller 



Table 8-8. DMA Global-Control Register Bits 



Bit Name Reset Value Function 



1-0 


START 


0-0 


These bits control the state in which the DMA starts and stops. The 
uma may D6 sioppea wnnoui any loss ot aaia (see iauie o— yj. 


3-2 


STAT 


0-0 


These bits indicate the status of the DMA and change every cycle 
(see lauieo— iuj. 


4 


INCSRC 


0 


If INCSRC = 1 , the source address is incremented after every read. 


5 


DECSRC 


0 


If DECSRC = 1 the source address is decremented after everv 

1 1 hr IbWV/I IW ^ 1 | lil lw WVVII WW WiWWl WWW ■ W MV VI Wl ■ 1 Wl l%WW> Wll »Wl W V Wl J 

read. If INCSRC = DECSRC, the source address is not modified 
after a read. 


6 


INCDST 


0 


If INCDST = 1 the destination address is incremented after everv 

II 1 1 « Vl/w 1 1 | LI 1 W VlVWill Iwill Wl 1 MMMI www IW II 1 Wl Wl 1 1 Wl IIWVI Wil Iwl W V wl V 

write. 


7 


DECDST 


0 


If DECDST = 1 , the destination address is decremented after every 
write. If INCDST = DECDST, the destination address is not modified 
after a write. 


9-8 


SYNC 


0-0 


The SYNC bits determine the timing synchronization between the 
events initiating the source and the destination transfers. The inter- 
pretation of the SYNC bits is shown in Table 8—1 1 . 


10 


TC 


0 


The TC bit affects the operation of the transfer counter. If TC = 0, 
transfers are not terminated when the transfer counter becomes 0. 
If TC = 1 , transfers are terminated when the transfer counter be- 
comes 0. 


11 


TCI NT 


0 


If TCI NT = 1 , the DMA interrupt is set when the transfer counter 
makes a transition to 0. If TCI NT = 0, the DMA interrupt is not set 
when the transfer counter makes a transition to 0. 


31-12 


Reserved 


0-0 


Read as 0. 



Note: When the DMA completes a transfer, the START bits remain in 1 1 (base 2) . The DMA starts when the START bits are set 
to 11 and one of the following conditions applies: 

□ The transfer counter is set to a value different from 0x0, or 

□ The TC bit is set to 0. 
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Table 8-9. START Bits and Operation of the DMA (Bits 0-1) 



START Function 



0 0 DMA read or write cycles in progress will be completed; any data read will 

be ignored. Any pending read or write will be cancelled. The DMA is reset 
so that when it starts a new transaction begins; that is, a read is per- 
formed. (Reset value) 

01 If a read or write has begun, it is completed before it stops. If a read or 
write has not begun, no read or write is started. 

10 If a DMA transfer has begun, the entire transfer is completed (including 
both read and write operations) before stopping. If a transfer has not be- 
gun, none is started. 

1 1 DMA starts from reset or restarts from the previous state. 



Table 8-10. STAT Bits and Status of the DMA (Bits 2-3) 



STAT 


Function 


00 


DMA is being held between DMA transfer (between a write and read). 




This is the value at reset. (Reset value) 


01 


DMA is being held in the middle of a DMA transfer, that is, between a read 




and a write. 


1 0 


Reserved. 


1 1 


DMA busy; that is, DMA is performing a read or write or waiting for a 




source or destination synchronization interrupt. 



Table 8-11. SYNC Bits and Synchronization of the DMA (Bits 8-9) 



SYNC 


Function 


00 


No synchronization. Enabled interrupts are ignored. (Reset value) 


01 


Source synchronization. A read is performed when an enabled interrupt 




occurs. 


1 0 


Destination synchronization. A write is performed when an enabled inter- 




rupt occurs. 


1 1 


Source and destination synchronization. A read is performed when an 




enabled interrupt occurs. A write is then performed when the next en- 




abled interrupt occurs. 
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8.3.1 DMA Global-Control Register 

The global-control register controls the state in which the DMA controller oper- 
ates. This register also indicates the status of the DMA, which changes every 
cycle. Source and destination addresses can be incremented, decremented, 
or synchronized using specified global-control register bits. At system reset, 
all bits in the DMA control register are cleared to 0. Table 8-8 on page 8-45 
lists the register bits, names, and functions. Figure 8-31 shows the bit config- 
uration of the global-control register. 



Figure 8-31. DMA Global-Control Register 
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TCINT 


TC 


SYNC 


DECDST 


INCDST 


DECSRC 


INCSRC 


STAT 


START 
















R/W 


R/W 


R/W R/W 


R/W 




R/W 


R/W 


R/W 




R 


R 


R/W R/W 



R = Read, W = Write, xx= reserved bit, read as 0 

8.3.2 Destination- and Source-Address Registers 

The DMA destination-and-source address registers are 24-bit registers whose 
contents specify destination and source addresses. As specified by control 
bits DECSRC, INCSRC, DECDST, and INCDST of the DMA global-control 
register, these registers are incremented and decremented at the end of the 
corresponding memory access, that is, the source register for a read and the 
destination register for a write. On system reset, 0 is written to these registers. 

8.3.3 Transfer-Counter Register 

The transfer-counter register is a 24-bit register, controlled by a 24-bit counter 
that counts down. The counter decrements at the beginning of a DMA memory 
write. In this way, it can control the size of a block of data transferred. The trans- 
fer counter register is set to 0 at system reset. When the TCINT bit of DMA 
global-control register is set, the transfer-counter register will cause a DMA in- 
terrupt flag to be set upon count down to 0. 

8.3.4 CPU/DMA Interrupt-Enable Register 

The CPU/DMA interrupt enable register (IE) is a 32-bit register located in the 
CPU register file. The CPU interrupt enable bits are in locations 10-1. The 
DMA interrupt-enable bits are in locations 26-1 6. A 1 in a CPU/DMA interrupt- 
enable register bit enables the corresponding interrupt. A 0 disables the corre- 
sponding interrupt. At reset, 0 is written to this register. 
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Table &-1 2 lists the bits, names, and functions of the CPU/DMA interrupt en- 
able register. Figure &-32 shows the IE register. The priority and decoding 
schemes of CPU and DMA interrupts are identical. Note that when the DMA 
receives an interrupt, this interrupt is acted upon according to the SYNC field 
of the DMA control register. Also note that an interrupt can affect the DMA but 
not the CPU and can affect the CPU but not the DMA. Refer to subsection 3.1 .8 
on page 3-7 and to Chapter 6. 

Table 8-12. CPU/DMA Interrupt-Enable Register Bits 



Bit 


hi Am a 


Function 

■ UIIVllVJIl 


o 


EINTO 


Fnahle external interrupt 0 fCPLh 


1 

1 


FINT1 


Fnahlfi pytArnal intorrunt 1 ff^PU^ 


o 

c 


FINT2 


Fnflhlft AYtArnAl intern int 9 ffiPl \\ 


3 


EINT3 


Enable external interruot 3 fCPU^ 


4 


EXINTO 


Enable serial-port 0 transmit interrupt (CPU) 


5 


ERINTO 


Enable serial-port 0 receive interrupt (CPU) 


6 


EXINT1 


Enable serial-port 1 transmit interrupt (CPU) 


7 


ERINT1 


Enable serial-port 1 receive interrupt (CPU) 


8 


ETINT0 


Enable timer 0 interrupt (CPU) 


9 


ETINT1 


Enable timer 1 interrupt (CPU) 


10 


EDINT 


Enable DMA controller interrupt (CPU) 


15-11 


Reserved 


Read as 0 


16 


EINTO 


Enable external interrupt 0 (DMA) 


17 


EINT1 


Enable external interrupt 1 (DMA) 


18 


EINT2 


Enable external interrupt 2 (DMA) 


19 


EINT3 


Enable external interrupt 3 (DMA) 


20 


EXINTO 


Enable serial-port 0 transmit interrupt (DMA) 


21 


ERINTO 


Enable serial-port 0 receive interrupt (DMA) 


22 


EXINT1 


Enable serial-port 1 transmit interrupt (DMA) 


23 


ERINT1 


Enable serial-port 1 receive interrupt (DMA) 


24 


ETINT0 


Enable timer 0 interrupt (DMA) 


25 


ETINT1 


Enable timer 1 interrupt (DMA) 


26 


EDINT 


Enable DMA controller interrupt (DMA) 


31-27 


Reserved 


Read as 0 
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Figure 8-32. CPU/DMA Interrupt-Enable Register 
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(DMA) 
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(DMA) 
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(DMA) 
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(DMA) 


EXINT1 
(DMA) 


ERINTO 
(DMA) 


EXINTO 
(DMA) 


EINT3 
(DMA) 


EINT2 
(DMA) 


EINT1 
(DMA) 


EINTO 
(DMA) 












R/W 


R/W 


R/W 


R/W 


R/W 


R/W 


R/W 


R/W 


R/W 


R/W 


R/W 


15 


14 


13 


12 


11 


10 


9 


8 


7 


6 


5 


4 


3 


2 


1 


0 


XX 


XX 


XX 


XX 


XX 


EDINT 


ETINT1 


ETINTO 


ERINT1 


EXINT1 


ERINTO 


EXINTO 


EINT3 


EINT2 


EINT1 


EINTO 


(CPU) 


(CPU) 


(CPU) 


(CPU) 


(CPU) 


(CPU) 


(CPU) 


(CPU) 


(CPU) 


(CPU) 


(CPU) 












R/W 


R/W 


R/W 


R/W 


R/W 


R/W 


R/W 


R/W 


R/W 


R/W 


R/W 



Note: xx = Reserved bit, read as 0 
R = read, W = write 



8.3.5 DMA Memory Transfer Operation 

Each DMA memory transfer consists of two parts: 

□ Read data from the address specified by the DMA source register 

□ Write data that has been read to the address specified by the DMA desti- 
nation register 

A transfer is complete only when the read and write are complete. You can stop 
a transfer by setting the START bits to the desired value. When the DMA is re- 
started (START =1 1), it completes any pending transfer. 

At the end of a DMA read, the source address is modified as specified by the 
SRCINC and SRCDEC bits of the DMA global-control register. At the end of 
a DMA write, the destination address is modified as specified by the DSTINC 
and DSTDEC bits of the DMA global control register. At the end of every DMA 
write, the DMA transfer counter is decremented. 

DMA on-chip reads and writes (reads and writes from on-chip memory and pe- 
ripherals) are single-cycle. DMA off-chip reads are two cycles. The first cycle 
is the external read, and the second cycle loads the DMA register. The external 
read cycle is identical to a CPU read cycle. DMA off-chip writes are identical 
to CPU off-chip writes. If the DMA has been started and is transferring data 
over either external bus, you should not modify the bus-control register asso- 
ciated with that bus. If you must modify the bus-control register (see Chapter 
7), stop the DMA, make the modification, and then restart the DMA. Failure to 
do this may produce an unexpected zero-wait-state bus access. 
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Through the 24-bit source and destination registers, the DMA is capable of ac- 
cessing any memory-mapped location in the TMS320C3x memory map. 
Table 8-13, Table 8-14, and Table 8-15 show the number of cycles a DMA 
transfer requires, depending on whether the source and destination are on- 
chip memory and peripherals, the external port, or the I/O port. T represents 
the number of transfers to be performed, C r represents the number of wait- 
states for the source read , and represents the number of wait-states for the 
destination write. Each entry in the table represents the total cycles required 
to do the T transfers, assuming that there are no pipeline conflicts. 

Accompanying each table is a figure illustrating the timing of the DMA transfer. 
|R| and |W| represent single-cycle reads and writes, respectively. |R.R| and 
|W.W| represent multicycle reads and writes. |C r | and |C W | show the number 
of wait cycles for a read and write. 

Table 8-1 3. DMA Timing When Destination Is On-Chip 



Cycles (H1) 


1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 


Source On-Chip 
Destination On-Chip 


R R R :::::::::::: : 
|w| |w| |w| ::::::::::: : 


Source Primary Bus 
Destination On-Chip 


R.R.R: I | |R.R.R: I | | R.R.R:I | : : : : 
I °r | : : | C r | s : | C r | s - s : : 

: : : |w| : : : |w| : : : |w| : : : 


Source Expansion Bus 
Destination On-Chip 


R .R .R: l| | R.R.R : I | | R.R.R: I | : : : : 
: | Cr | . s | Cr | s s | Cr | : : : : : 

: : : |W| : : : |W| : : : |W| : : : 



Source 


Destination On-Chip 


On-Chip 


(1+1)7 


Primary Bus 


(2 + C r + 1)7 


Expansion Bus 


(2 + Cr + 1)7 



Legend: 




T 


Number of transfers 


C r 


Source-read wait states 


C w = 


Destination-write wait states 


|R| 


Single-cycle reads 


|W| = 


Single-cycle writes 


|R.R| - 


Multicycle reads 


|W.W| = 


Multicycle writes 


Ml - 


Internal register cycle 
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Table 8-1 4. DMA Timing When Destination Is a Primary Bus 



Cycles (H1) 


1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 


Source On-Chip 
Destination Primary Bus 


: : 1 Cw | : | Cw | : | Cw | : : : : : 


Source Primary Bus 
Destination Primary Bus 


R .R . R: 1 | : : : -R -R.R : 1 1 : : : : : : 
| Cr | . . . . . | Cr | . . . . . . 

: : : |W.W.W.w| : : : |w.W.W.w| : : 
: : : : : | C w 1 : : : : : I Cw 1 : : 


Source Expansion Bus 
Destination Primary Bus 


R . R .R : l| | R . R. R : || | R . R .R : 1 1 : : : : 
I Cr | ■ . | Cr | s s | Cr | - s : s s 

: : : |W.W.W.W| |W.W.W.W| |W.W.W.W| 



Source 


Destination Primary Bus 


On-Chip 


1 + (2 + Cw)T 


Primary 


(2+C r + 2+Cw)7" 


Bus 




Expansion 


(2 + C r + 2 + Cw) 


Bus 


+ (2 + 0^ + max(1 , C r - Cw + 




W-1) 



Legend: 




T 


Number of transfers 


C r 


Source-read wait states 


Cw = 


Destination-write wait states 


|R| 


Single-cycle reads 


|W| = 


Single-cycle writes 


|R.R| = 


Multicycle reads 


|W.W| = 


Multicycle writes 


Ml = 


Internal register cycle 
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Table 8-1 5. DMA Timing When Destination Is an Expansion Bus 



Cycles (H1) 


1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 


Source On-Chip 
Destination Expansion Bus 


R R : : R ::::::::::: 


Source Primary Bus 
Destination Expansion Bus 


R.R-R| i| |R.R .R : I | | R-R . R : 1 1 : : : : 

I °r I : : I °r I : : I °r | : : : : : 

Iwwwwl Iwwwwl Iwwwwl 
: : : : : l" C w | : : | Cw | : : | C w | 


Source Expansion Bus 
Destination Expansion Bus 


R.R.R.I | : : : | R . R . R : 1 1 : : : : : : 
I °r I : : : : : I c r | : : : : : : : 

: : : |W.W.W.W| : : : |W.W.W.W| : : 
: : : : : I C w | : : : : : | C w | : : 



Source 


Destination Expansion Bus 


On-Chip 


l + P + W 


Primary 


(2 + C r + 2+CW) 


Bus 


+ (2 + C^y + maxO.Cr- + 






Expansion 


(2 + ^ + 2+0^)7 


Bus 





Legend: 




T 


Number of transfers 


C r 


Source-read wait states 


C w = 


Destination-write wait states 


|R| = 


Single-cycle reads 


|W| = 


Single-cycle writes 


|R.R| = 


Multicycle reads 


|W.W| = 


Multicycle writes 


Ml = 


Internal register cycle 
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Table 8-1 6 shows the maximum DMA transfer rates, assuming that there are 
no wait states (C r = = 0). Table 8-17 shows the maximum DMA transfer 
rates, assuming there is one wait state for the read (C r = 1 ) and no wait states 
for the write (C^ = 0). Table 8-18 shows the maximum DMA transfer rates, 
assuming there is one wait state for the read (C r = 1 ) and one wait state for the 
write (C^= 1). 

In each table, the time for the complete transfer (the read and the write) is con- 
sidered. Since one bus access is required for the read and another for the 
write, internal bus transfer rates will be twice the DMA transfer rate. It is also 
assumed that no conflicts with the CPU exist. Rates are listed in Mwords/sec. 
A word is 32 bits (4 bytes). 



Table 8-1 6. Maximum DMA Transfer Rates When C r = 0^ = 0 





Destination 


Source 


Internal 


Primary 


Expansion 


Internal 


8.33 Mwords/sec 


8.33 Mwords/sec 


8.33 Mwords/sec 


Primary 


5.56 Mwords/sec 


4.17 Mwords/sec 


5.56 Mwords/sec 


Expansion 


5.56 Mwords/sec 


5.56 Mwords/sec 


4.17 Mwords/sec 


Table 8-1 7. Maximum DMA Transfer Rates When C r = 1, Cw = 0 




Destination 


Source 


Internal 


Primary 


Expansion 


Internal 


8.33 Mwords/sec 


8.33 Mwords/sec 


8.33 Mwords/sec 


Primary 


4.17 Mwords/sec 


3.33 Mwords/sec 


4.17 Mwords/sec 


Expansion 


4.17 Mwords/sec 


4.17 Mwords/sec 


3.33 Mwords/sec 


Table 8-1 8. Maximum DMA Transfer Rates When C r = 1, Cw= 1 




Destination 


Source 


Internal 


Primary 


Expansion 


Internal 


8.33 Mwords/sec 


5.56 Mwords/sec 


5.56 Mwords/sec 


Primary 


4.17 Mwords/sec 


2.78 Mwords/sec 


4.17 Mwords/sec 


Expansion 


4.17 Mwords/sec 


4.17 Mwords/sec 


2.78 Mwords/sec 
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8.3.6 Synchronization of DMA Channels 

You can synchronize a DMA channel with interrupts. Refer to Table 8-11 on 
page 8-46 for the relationship between the SYNC bits of the DMA global con- 
trol register and the synchronization performed. This section describes the fol- 
lowing four synchronization mechanisms: 

□ No synchronization (SYNC = 0 0) 

□ Source synchronization (SYNC = 0 1) 

□ Destination synchronization (SYNC = 1 0) 

□ Source and destination synchronization (SYNC = 11) 

No Synchronization 

When SYNC = 0 0, no synchronization is performed. The DMA performs reads 
and writes whenever there are no conflicts. All interrupts are ignored and 
therefore are considered to be globally disabled. However, no bits in the DMA 
interrupt-enable register are changed. Figure 8-33 shows the 
synchronization mechanism when SYNC = 0 0. 



Figure 8-33. No DMA Synchronization 




Disable DMA Interrupts 



| DMA Channel Performs a Read 



DMA Channel Performs a Write 



Source Synchronization 

When SYNC=0 1 , the DMA is synchronized to the source (see Figure 8-34). 
A read will not be performed until an interrupt is received by the DMA. Then 
all DMA interrupts are disabled globally. However, no bits in the DMA interrupt 
enable register are changed. 



8-54 



DMA Controller 



Figure 8-34. DMA Source Synchronization 



Start 




| Idle Until Enabled Interrupt Is Received | 
| Disable DMA Interrupts Globally | 



| DMA Channel Performs a Read | 
I Enable DMA Interrupts Globally I 

1 

| DMA Channel Performs a Write] 
| Go to Start | 

Destination Synchronization 

When SYNC = 1 0, the DMA is synchronized to the destination. First, all inter- 
rupts are ignored until the read is complete. Though the DMA interrupts are 
considered globally disabled, no bits in the DMA interrupt-enable register are 
changed. A write will not be performed until an interrupt is received by the 
DMA. Figure 8-35 shows the synchronization mechanism when SYNC = 1 0. 



Figure 8-35. DMA Destination Synchronization 



I DMA Channel Performs a Read 

— 22 i — 



| idle Until Enabled ^Interrupt Is Received 



I Disable DMA Interrupts Globally 

— — 



| DMA Channel Performs a Write 

™ 



| DMA Interrupts Are Enabled Globally" 
| Go to Start 1 

Source and Destination Synchronization 

When SYNC = 1 1 , the DMA is synchronized to both the source and destina- 
tion. A read is performed when an interrupt is received. A write is performed 
on the following interrupt. Source and destination synchronization when 
SYNC = 1 1 is shown in Figure 8-36. 
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Figure 8-36. DMA Source and Destination Synchronization 

I Start I 




| Idle Until Enabled Interrupt is Received | 
j Disable DMA Interrupts Globally | 
I DMA Channel Performs a Read I 

1 

| Enable DMA Interrupts Globally | 
| Idle Until Enabled Interrupt Is Received 1 
| Disable DMA Interrupts Globally | 
I DMA Channel Performs a Write I 

r 

I Enable DMA Interrupts Globally | 
| Go to Start | 



8.3.7 DMA Interrupts 

You can generate a DMA interrupt to the CPU whenever the transfer count 
reaches 0, indicating that the last transfer has taken place. The TCI NT bit in 
the DMA global control register determines whether the interrupt will be gener- 
ated. If TCINT = 1 , the DMA interrupt is generated. If TCI NT = 0, the DMA inter- 
rupt is not generated. If the DMA interrupt is generated, the EDINT bit, bit 10 
in the interrupt enable register, must also be set to enable the CPU to be inter- 
rupted by the DMA. 

A second bit in the DMA global control register, the TC bit, is also generally 
associated with the state of the TCINT bit and the interrupt operation. The TC 
bit determines whether transfers are terminated when the transfer counter be- 
comes 0 or whether they are allowed to continue. If TC = 1 , transfers are termi- 
nated when the transfer count becomes 0. If TC = 0, transfers are not termi- 
nated when the transfer count becomes 0. 

In general, if TCINT is 0, TC should also be cleared to 0. Otherwise, the DMA 
transfer will terminate, and the CPU will not be notified. If TCINT is 1 , TC should 
also be 1 in most cases. In this case, the CPU will be notified when the transfer 
completes, and the DMA will be halted and ready to start a new transfer. 
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8.3.8 DMA Initialization/Reconfiguration 

You can control the DMA through memory-mapped registers located on the 
dedicated peripheral bus. Following is the general procedure for initializing 
and/or reconfiguring the DMA: 

1 ) Halt the DMA by clearing the START bits of the DMA global-control regis- 
ter. You can do this by writing a 0 to the DMA global-control register. Note 
that the DMA is halted on RESET. 

2) Configure the DMA via the DMA global-control register (with START = 00) , 
as well as the DMA source, destination, and transfer-counter registers, if 
necessary. Refer to subsection 8.3.1 0 on page 8-58 for more information. 

3) Start the DMA by setting the START bits of the DMA global-control register 
as necessary. 

8.3.9 Hints for DMA Programming 

The following hints help you improve your DMA programming and avoid unex- 
pected results: 

□ Reset the DMA register before starting it. This clears any previously 
latched interrupt that may no longer exist. 

□ In the event of a CPU-DMA access conflict, the CPU always prevails. 
Carefully allocate the different sections of the program in memory for fast- 
er execution. If a CPU program access conflicts with a DMA access, enab- 
ling the cache helps if the program is located in external memory. DMA on- 
chip access happens during the H3 phase. Refer to Chapter 9 for details 
on CPU accesses. 

i i 
Note: Expansion and Peripheral Buses 

The expansion and peripheral buses cannot be accessed simultaneously 

because they are multiplexed into a common port (see Figure 2-1 on page 

2-3). This might increase CPU-DMA access conflicts, 
i i 

□ Ensure that each interrupt is received when you use interrupt synchroniza- 
tion; otherwise, the DMA will never complete the block transfer. 

□ Use read/write synchronization when reading from or writing to serial ports 
to guarantee data validity. 

The following are indications that the DMA has finished a set of transfers: 

□ The DINT bit in the IIF register is set to 1 (interrupt polling). This requires 
that the TCINT bit in the DMA control register be set first. This interrupt- 
polling method does not cause any additional CPU-DMA access conflict. 
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□ The transfer counter has a zero value. However, notice that the transfer 
counter is decremented after the DMA read operation finishes (not after 
the write operation). Nevertheless, a transfer counter with a 0 value can 
be used as an indication of a transfer completion. 

□ The STAT bits in the DMA channel control register are set to OO2. You can 
poll the DMA channel control register for this value. However, because the 
DMA registers are memory-mapped into the peripheral bus address 
space, this option can cause further CPU-DMA access conflicts. 



Example 8-5, Example 8-6, and Example 8-7 illustrate initialization proce- 
dures for the DMA. 

When linking the examples, you should allocate section memory addresses 
carefully to avoid CPU-DMA conflict. In the 'C3x, the CPU always prevails in 
cases of conflict. In the event of a CPU program-DMA data conflict, the enab- 
ling of the cache helps if the .text section is in external memory. For example, 
when linking the code in Example 8-5, Example 8-6, and Example 8-7, the 
.text section can be allocated into RAMO, .data into RAMI, and .bss into 
RAMI , where RAMO and RAMI correspond to on-chip RAM block 0 and block 
1 , respectively. 

In Example 8-5, the DMA initializes a 1 28-element array to 0. The DMA sends 
an interrupt to the CPU after the transfer is completed. This program assumes 
previous initialization of the CPU interrupt vector table (specifically the DMA- 
to-CPU interrupt). The program initializes the ST and IE registers for interrupt 
processing. 



8.3.10 DMA Programming Examples 



Example 8-5. Array Initialization With DMA 



* TITLE: ARRAY INITIALIZATION WITH DMA 



.GLOBAL START 
.DATA 



DMA .WORD 808000H 

RESET .WORD 0C40H 

CONTROL .WORD 0C43H 

SOURCE .WORD ZERO 

DESTIN .WORD _ARRAY 

COUNT .WORD 128 

ZERO .FLOAT 0.0 



DMA GLOBAL CONTROL REG ADDRESS 
DMA GLOBAL CONTROL REG RESET VALUE 
DMA GLOBAL CONTROL REG INITIALIZATION 
DATA SOURCE ADDRESS 
DATA DESTINATION ADDRESS 
NUMBER OF WORDS TO TRANSFER 

ARRAY INITIALIZATION VALUE 0.0 * 0x80000000 
DATA ARRAY LOCATED IN .BSS SECTION 



.BSS _ARRAY, 128 
.TEXT 
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START 



LDP 


DMA 


LDI 


@DMA, ARO 


LDI 


@RESET,R0 


STI 


R0,*AR0 


LDI 


@ SOURCE , RO 


STI 


R0,*+AR0(4) 


LDI 


@DESTIN,R0 


STI 


R0,*+AR0(6) 


LDI 


§ COUNT ,R0 


STI 


RO , *+AR0 ( 8 ) 


OR 


400H, IE 


OR 


2000H, ST 


LDI 


8 CONTROL, RO 


STI 


RO,*ARO 


BU 


$ 


.END 



; LOAD DATA PAGE POINTER 

; POINT TO DMA GLOBAL CONTROL REGISTER 

; RESET DMA 

; INITIALIZE DMA SOURCE ADDRESS REGISTER 

; INITIALIZE DMA DESTINATION ADDRESS REGISTER 

; INITIALIZE DMA TRANSFER COUNTER REGISTER 

; ENABLE INTERRUPT FROM DMA TO CPU 

; ENABLE CPU INTERRUPTS GLOBALLY 

; INITIALIZE DMA GLOBAL CONTROL REGISTER 

; START DMA TRANSFER 



Example 8-6 sets up the DMA to transfer data (1 28 words) from the serial port 
0 input register to an array buffer with serial port receive interrupt (RINTO). The 
DMA sends an interrupt to the CPU when the data transfer completes. 

Serial port 0 is initialized to receive 32-bit data words with an internally gener- 
ated receive-bit clock and a bit-transfer rate of 8H1 cycles/bit. 

This program assumes previous initialization of the CPU interrupt vector table 
(specifically the DMA-to-CPU interrupt). The serial port interrupt directly af- 
fects only the DMA; therefore, no CPU serial port interrupt vector setting is re- 
quired. 



Example 8-6. DMA Transfer With Serial-Port Receive Interrupt 

* TITLE DMA TRANSFER WITH SERIAL PORT RECEIVE INTERRUPT 
* 

•GLOBAL START 





.DATA 






DMA 


.WORD 


808000H 


; DMA GLOBAL CONTROL REG ADDRESS 


CONTROL 


.WORD 


0D43H 


; DMA GLOBAL CONTROL REG INITIALIZATION 


SOURCE 


.WORD 


80804CH 


? DATA SOURCE ADDRESS: SERIAL PORT INPUT REG 


DESTIN 


.WORD 


ARRAY 


; DATA DESTINATION ADDRESS 


COUNT 


.WORD 


128 


; NUMBER OF WORDS TO TRANSFER 


IEVAL 


.WORD 


00200400H 


; IE REGISTER VALUE 


RESET 1 


.WORD 


0D40H 


} DMA RESET 




.BSS 


__ARRAY, 128 


y DATA ARRAY LOCATED IN .BSS SECTION 








; THE UNDERSCORE USED IS JUST TO MAKE IT 








; ACCESSIBLE FROM C (OPTIONAL) 


SPORT 


.WORD 


808040H 


; SERIAL PORT GLOBAL CONTROL REG ADDRESS 


SGCCTRL 


.WORD 


0A300080H 


; SERIAL PORT GLOBAL CONTROL REG INITIALIZATION 


SRCTRL 


.WORD 


111H 


; SERIAL PORT RX PORT CONTROL REG INITIALIZATION 


STCTRL 


.WORD 


3C0H 


; SERIAL PORT TIMER CONTROL REG INITIALIZATION 


STPERIOD 


.WORD 


00020000H 


; SERIAL PORT TIMER PERIOD 


SPRESET 


.WORD 


01300080H 


} SERIAL PORT RESET 


RESET 


.WORD 


OH 


; SERIAL-PORT TIMER RESET 




.TEXT 






START 


LDP DMA 


; LOAD DATA PAGE POINTER 
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* DMA INITIALIZATION 

LDI @DMA,AR0 
LDI § SPORT ,AR1 
LDI @RESET,R0 
STI R0,*+AR1(4) 
LDI @ RESET 1,R0 
STI RO , *AR0 
LDI @SPRESET,RO 
STI RO / *AR1 
LDI @ SOURCE, RO 
STI R0,*+ARO(4) 
LDI @DESTIN,R0 
STI R0,*+AR0(6) 
LDI 8 COUNT, RO 
STI R0,*+AR0(8) 
OR §IEVAL,IE 
OR 2000H,ST 
LDI @CONTROL,R0 
STI R0,*AR0 

* SERIAL PORT INITIALIZATION 

LDI §SRCTRL,R0 
STI R0,*+AR1(3) 
LDI §STPERIOD,R0 
STI R0,*+AR1(6) 
LDI §STCTRL,R0 
STI R0,*+AR1(4) 
LDI §SGCCTRL,RO 
STI R0,*AR1 
BU $ 
.END 



; POINT TO DMA GLOBAL CONTROL REGISTER 

RESET SPORT TIMER 
RESET DMA 
RESET SPORT 

INITIALIZE DMA SOURCE ADDRESS REGISTER 

INITIALIZE DMA DESTINATION ADDRESS REGISTER 

INITIALIZE DMA TRANSFER COUNTER REGISTER 

ENABLE INTERRUPTS 
ENABLE CPU INTERRUPTS GLOBALLY 
INITIALIZE DMA GLOBAL CONTROL REGISTER 
START DMA TRANSFER 

SERIAL-PORT RECEIVE CONTROL REG INITIALIZATION 
SERIAL-PORT TIMER PERIOD INITIALIZATION 
SERIAL-PORT TIMER CONTROL REG INITIALIZATION 
SERIAL-PORT GLOBAL CONTROL REG INITIALIZATION 



Example 8-7 sets up the DMA to transfer data (1 28 words) from an array buff- 
er to the serial port 0 output register with serial port transmit interrupt XINTO. 
The DMA sends an interrupt to the CPU when the data transfer completes. 

Serial port 0 is initialized to transmit 32-bit data words with an internally gener- 
ated frame sync and a bit-transfer rate of 8H1 cycles/bit. The receive-bit clock 
is internally generated and equal in frequency to one-half of the 'C3x H1 fre- 
quency. 

This program assumes previous initialization of the CPU interrupt vector table 
(specifically the DMA-to-CPU interrupt). The serial port interrupt directly af- 
fects only the DMA; therefore, no CPU serial port interrupt vector setting is re- 
quired. 

i 1 

Note: Serial Port Transmit Synchronization 

The DMA uses serial port transmit interrupt XINTO to synchronize transfers. 
Because the XINTO is generated when the transmit buffer has written the last 
bit of data to the shifter, an initial CPU write to the serial port is required to 
trigger XINTO to enable the first DMA transfer. 
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Example 8-7. DMA Transfer With Serial-Port Transmit Interrupt 



* TITLE: DMA TRANSFER WITH 

* .GLOBAL START 
.DATA 

DMA .WORD 808000H 

CONTROL .WORD 0E13H 
SOURCE . WORD ( _ARRAY+ 1 ) 
DESTIN .WORD 80804CH 
COUNT .WORD 127 

IEVAL .WORD 00100400H 

.BSS _ARRAY,128 



SERIAL PORT TRANSMIT INTERRUPT 



DMA GLOBAL CONTROL REG ADDRESS 

DMA GLOBAL CONTROL REG INITIALIZATION 

DATA SOURCE ADDRESS 

DATA DESTIN ADDRESS: SERIAL-PORT OUTPUT REG 
NUMBER OF WORDS TO TRANSFER =(MSG LENGHT-1 ) 
IE REGISTER VALUE 

DATA ARRAY LOCATED IN .BSS SECTION 
THE UNDERSCORE USED IS JUST TO MAKE IT 
ACCESSIBLE FROM C (OPTIONAL) 



RESET1 


.WORD 


0E10H 


; DMA RESET 




SPORT 


.WORD 


808040H 


; SERIAL-PORT 


GLOBAL CONTROL REG ADDRESS 


SGCCTRL 


.WORD 


04880044H 


; SERIAL-PORT 


GLOBAL CONTROL REG INITIALIZATION 


SXCTRL 


.WORD 


111H 


; SERIAL-PORT 


TX PORT CONTROL REG INITIALIZATION 


STCTRL 


.WORD 


OOFH 


; SERIAL-PORT 


TIMER CONTROL REG INITIALIZATION 


STPERIOD 


.WORD 


00000002H 


; SERIAL-PORT 


TIMER PERIOD 


SPRESET 


.WORD 


00880044H 


; SERIAL-PORT 


RESET 


RESET 


.WORD 


OH 


; SERIAL-PORT 


TIMER RESET 




.TEXT 








START 


LDP DMA 




; LOAD DATA PAGE POINTER 



* DMA INITIALIZATION 

LDI @DMA, ARO 
LDI @ SPORT, ARl 
LDI @ RESET, RO 
STIR0,*+AR1(4) 
STIRO, *AR0 
STI RO, *AR1 
LDI @ SOURCE , RO 
STIR0,*+AR0(4) 
LDI @DESTIN,R0 
STIR0,*+AR0(6) 
LDI 6 COUNT, RO 
STIR0,*+AR0(8) 
OR § IEVAL , IE 
OR 2000H,ST 
LDI @ CONTROL, RO 
STIR0,*AR0 



; POINT TO DMA GLOBAL CONTROL REGISTER 

? RESET SPORT TIMER 

; RESET DMA 

; RESET SPORT 

; INITIALIZE DMA SOURCE ADDRESS REGISTER 

; INITIALIZE DMA DESTINATION ADDRESS REGISTER 

; INITIALIZE DMA TRANSFER COUNTER REGISTER 

; ENABLE INTERRUPT FROM DMA TO CPU 

; ENABLE CPU INTERRUPTS GLOBALLY 

; INITIALIZE DMA GLOBAL CONTROL REGISTER 

? START DMA TRANSFER 
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* SERIAL PORT INITIALIZATION 



LDI iSXCTRL.RO 
STI R0, *+ARl(2) 
LDI @STPERIOD,R0 
STIR0,*+AR1(6) 
LDI §STCTRL,R0 
STIR0 r *+ARl(4) 
LDIgSGCCTRL,RO 
STI R0,*AR1 



SERIAL-PORT 



SERIAL-PORT 



SERIAL-PORT 



SERIAL-PORT 



GLOBAL CONTROL REG INITIALIZATION 



TIMER PERIOD INITIALIZATION 



TIMER CONTROL REG INITIALIZATION 



TX CONTROL REG INITIALIZATION 



* CPU WRITES THE FIRST WORD (TRIGGERING EVENT > XINT IS GENERATED) 



LDI @ SOURCE, ARO 
LDI *-AR0(l),R0 
STI R0,*+AR1(8) 
BU $ 



.END 



Other examples are as follows: 

□ Transfer a 256-word block of data from off-chip memory to on-chip 
memory and generate an interrupt on completion. The order of memory 
is to be maintained. 



CPU/DMA interrupt enable (IE): 00000400h 

□ Transfer a 128-word block of data from on-chip memory to off-chip 
memory and generate an interrupt on completion. The order of memory 
is to be inverted; that is, the highest addressed member of the block is to 
become the lowest addressed member. 

DMA source address: 809800h 
DMA destination address: 800000h 
DMA transfer counter: 00000080h 
DMA global control: 00000C93h 
CPU/DMA interrupt enable (IE): 00000400h 

□ Transfer a 200-word block of data from the serial-port-0 receive register 
to on-chip memory and generate an interrupt on completion. The transfer 
is to be synchronized with the serial-port-0 receive interrupt. 

DMA source address: 80804Ch 
DMA destination address: 809C00h 
DMA transfer counter: 000000C8h 
DMA global control: 00000D43h 
CPU/DMA interrupt enable (IE): 00200400h 



DMA source address: 
DMA destination address: 
DMA transfer counter: 
DMA global control: 



800000h 
809800h 
000001 OOh 
00000C53h 
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□ Transfer a 200-word block of data from off-chip memory to the serial-port-0 
transmit register and generate an interrupt on completion. The transfer is 
to be synchronized with the serial-port-0 transmit interrupt. 

DMA source address: 809C00h 
DMA destination address: 808048h 
DMA transfer counter: 000000C8h 
DMA global control: 00000E1 3h 

CPU/DMA interrupt enable (IE): 00400400h 

□ Transfer data continuously between the serial-port-0 receive register and 
the serial-port-0 transmit register to create a digital loop back. The transfer 
is to be synchronized with the serial-port-0 receive and transmit interrupts. 

DMA source address: 80804Ch 
DMA destination address: 808048h 
DMA transfer counter: OOOOOOOOh 
DMA global control: 00000303b 
CPU/DMA interrupt enable (IE): 00300000h 
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Pipeline Operation 



Two characteristics of the TMS320C3x that contribute to its high performance 
are: 

□ Pipelining, and 

□ Concurrent I/O and CPU operation. 

Five functional units control TMS320C3x operation: 

□ Fetch 

□ Decode 

□ Read 

□ Execute 

□ Direct memory access (DMA) 

Pipelining is the overlapping or parallel operations of the fetch, decode, read, 
and execute levels of a basic instruction. 

By performing input/output operations, the DMA controller reduces the need 
for the CPU to do so, thereby decreasing pipeline interference and enhancing 
the CPU's computational throughput. 

Major topics discussed in this chapter are as follows: 



Topic Page 

9.1 Pipeline Structure 9-2 

9.2 Pipeline Conflicts 9-4 

9.3 Resolving Register Conflicts 9*18 

9.4 Resolving Memory Conflicts 9-21 

9.5 Clocking of Memory Accesses 9*23 
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9.1 Pipeline Structure 

The five major units of the TMS320C3x pipeline structure and their functions 
are as follows: 

□ Fetch Unit (F) 

This unit fetches the instruction words from memory and updates the pro- 
gram counter (PC). 

□ Decode Unit (D) 

This unit decodes the instruction word and performs address generation. 
The unit also controls any modifications to the auxiliary registers and the 
stack pointer. 

□ Read Unit (R) 

This unit, if required, reads the operands from memory. 

□ Execute Unit (E) 

This unit, if required, reads the operands from the register file, performs 
any necessary operation, and writes results to the register file. If required, 
the unit writes results of previous operations to memory. 

□ DMA Channel (DMA) 

The DMA channel reads and writes to memory. 

A basic instruction has four levels: 

□ Fetch 

□ Decode 

□ Read 

□ Execute 

Figure 9-1 illustrates these four levels of the pipeline structure. The levels are 
indexed according to instruction and execution cycle. The perfect overlap in 
the pipeline, where all four units operate in parallel, occurs at cycle (at?). Those 
levels about to be executed are at m + 1 , and those just executed are at m - 1 . 
The TMS320C3x pipeline control allows a high-speed execution rate of one 
execution per cycle. It also manages pipeline conflicts so that they are trans- 
parent to the user. You do not need to take any special precautions to guaran- 
tee correct operation. 
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Figure 9-1. TMS320C3x Pipeline Structure 

CYCLE | F | D | R | E | 



m-3 w 

m -2 x w 

m-1 y x w 

m z y x w Perfect overlap 

m + 1 - z y x 

m + 2 - - z y 

m + 3 - - - z 



D = Decode, E = Execute, F = Fetch, R = Read; W, X, Y,Z= Instruction Representations 

Priorities from highest to lowest have been assigned to each of the functional 
units as follows: 

1) Execute (highest) 

2) Read 

3) Decode 

4) Fetch 

5) DMA (lowest) 

When the processing of an instruction is ready to pass to the next higher pipe- 
line level, but that level is not ready to accept a new input, a pipeline conflict 
occurs. In this case, the lower-priority unit waits until the higher-priority unit 
completes its currently executing function. 

Despite the DMA controller's low priority, you can minimize or even eliminate 
conflicts with the CPU through suitable data structuring because the DMA con- 
troller has its own data and address buses. 
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9.2 Pipeline Conflicts 

The pipeline conflicts of the TMS320C3x can be grouped into the following 
categories: 

□ Branch Conflicts 

Branch conflicts involve most of those instructions or operations that read 
and/or modify the PC. 

□ Register Conflicts 

Register conflicts involve delays that can occur when reading from or writ- 
ing to registers that are used for address generation. 

□ Memory Conflicts 

Memory conflicts occur when the internal units of the TMS320C3x com- 
pete for memory resources. 

Each of these three categories is discussed in the following sections. Exam- 
ples are included. Note that in these examples, when data is refetched or an 
operation is repeated, the symbol representing the stage of the pipeline is ap- 
pended with a number. For example, if a fetch is performed again, the instruc- 
tion mnemonic is repeated. Whe n an a ccess is detained for multiple cycles be- 
cause of not ready, the symbols RDY and RDY are used to indicate not ready 
and ready, respectively. 



9.2.1 Branch Conflicts 

The first class of pipeline conflicts occurs with standard (nondelayed) 
branches, that is, BR, Bcond, DBcond, CALL, IDLE, RPTB, RPTS, RETIconof, 
RETScond, interrupts, and reset. Conflicts arise with these instructions and 
operations because during their execution, the pipeline is used only for the 
completion of the operation; other information fetched into the pipeline is dis- 
carded or refetched, or the pipeline is inactive. This is referred to as flushing 
the pipeline. Flushing the pipeline is necessary in these cases to guarantee 
that portions of succeeding instructions do not inadvertently get partially ex- 
ecuted. TRAPcond and CALLconcf are classified differently from the other 
types of branches and are considered later. 

Example 9-1 shows the code and pipeline operation for a standard branch. 

I ! 1 

Note: Dummy Fetch 

One dummy fetch (an MPYF instruction) is performed, which affects the 
cache. After the branch address is available, a new fetch (an OR instruction) 
is performed. 
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Example 9-1. Standard Branch 



BR 

MPYF 
ADD 
SUBF 
AND 



THREE 



Unconditional branch 
Not executed 
Not executed 
Not executed 
Not executed 



THREE OR 
STI 



; Fetched after BR is fetched 







PIPELINE OPERATION 




PC 


i ■= 


» 


i « i 


E 


n 


BR 








n + 1 


MPYF 


BR 






11^1 


(nop) 


(nop) 


BR 




n + 1 


(nop) 


(nop) 


(nop) 


BR 


THREE 


OR 


(nop) 


(nop) 


(nop) 




STI 

/ 


OR 

/ 


(nop) 


(nop) 



THREE -* PC 



Fetch held for 
new PC value 



D = Decode, E = Execute, F » Fetch, R - Read, PC * Program Counter 



RPTS and RPTB both flush the pipeline, allowing the RS, RE, and RC registers 
to be loaded at the proper time relative to the flow of the pipeline. If these regis- 
ters are loaded without the use of RPTS or RPTB, no flushing of the pipeline 
occurs. If you are not using any of the repeat modes, then you can use RS, RE, 
and RC as general-purpose 32-bit registers and not cause any pipeline con- 
flicts. In cases such as the nesting of RPTB due to nested interrupts, it might 
be necessary to load and store these registers directly while using the repeat 
modes. Since up to four instructions can be fetched before entering the repeat 
mode, you should follow loads by a branch to flush the pipeline. If the RC is 
changing when an instruction is loading it, the direct load takes priority over 
the modification made by the repeat mode logic. 
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Delayed branches are implemented to guarantee the fetching of the next three 
instructions. The delayed branches include BRD, BcondD, and DBcondD. 
Example 9-2 shows the code and pipeline operation for a delayed branch. 



Example 9-2. Delayed Branch 



BRD THREE 

MPYF 

ADD 

SUBF 

AND 



Unconditional delayed branch 

Executed 

Executed 

Executed 

Not executed 



THREE MPYF 



; Fetched after SUBF is fetched 



PC 

n 

n + 1 
n + 2 
n + 3 
THREE 



PIPELINE OPERATION 
| F | D | R | E | 



BRD - - - 

MPYF BRD - - 

ADDF MPYF BRD - 

SUBF ADDF MPYF BRD 

MPYF SUBF ADDF MPYF 



No execute delay 



THREE -* PC 

D = Decode, E = Execute, F « Fetch, R ■ Read, PC » Program Counter 
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9.2.2 Register Conflicts 

Register conflicts involve reading or writing registers used for addressing. 
These conflicts occur when the pertinent register is not ready to be used. Some 
conditions under which you can avoid register conflicts are discussed in Sec- 
tion 9.3 on page 9-18. 

The registers comprise the following three functional groups: 

□ Group 1 

This group includes auxiliary registers (AR0-AR7), index registers (IRO, 
IR1), and block size register (BK). 

□ Group 2 

This group includes the data page pointer (DP). 

□ Group 3 

This group includes the system stack pointer (SP). 

If an instruction writes to one of these three groups, the decode unit cannot use 
any register within that particular group until the write is complete, that is, in- 
struction execution is completed. In Example 9-3, an auxiliary register is 
loaded, and a different auxiliary register is used on the next instruction. Since 
the decode stage needs the result of the write to the auxiliary register, the de- 
code of this second instruction is delayed two cycles. Every time the decode 
is delayed, a refetch of the program word is performed; that is, the ADDF is 
fetched three times. Since these are actual refetches, they can cause not only 
conflicts with the DMA controller but also cache hits and misses. 
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Example 9-3. Write to an AR Followed by an AR for Address Generation 

LDI 7 , ARl ; 7 — * ARl 

NEXT MPYF *AR2,R0 ; Decode delayed 2 cycles 
ADDF 
FLOAT 



PIPELINE OPERATION 



PC 


l F 


I D 




I E 


n 


LDI 








n + 1 


MPYF 


LDI 






n + 2 


ADDF 


MPYF 


LDI 




n + 2 


ADDF 


MPYF 


(nop) 


LDI 7, ARl 


n + 2 


ADDF 


MPYF 


(nop) 


(nop) 


n + 3 


FLOAT 


ADDF 


MPYF 


(nop) 



D = Decode, E = Execute, F = Fetch, R = Read, PC = Program Counter 

The case for reads of these groups is similar to the case for writes. If an 
instruction must read a member of one of these groups, the use of that particu- 
lar group by the decode for the following instruction is delayed until the read 
is complete. The registers are read at the start of the execute cycle and there- 
fore require only a one-cycle delay of the following decode. For four registers 
(IRO, IR1, BK, or DP), there is no delay. For all other registers, including the 
SP, the delay occurs. 

In Example 9-4, two auxiliary registers are added together, with the result go- 
ing to an extended-precision register. The next instruction uses a different aux- 
iliary register as an address register. 
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Example 9-4. A Read ofARs Followed byARs for Address Generation 

ADDI AR0,AR1,R1 ; ARO + ARl Rl 

NEXT MPYF *++AR2,R0 ; Decode delayed one cycle 
ADDF 
FLOAT 



PIPELINE OPERATION 



PC 


I F I 


I ° 


I R 


I E I 


n 


ADDI 








n + 1 


MPYF 


ADDI 






n + 2 


ADDF 


MPYF 


ADDI 




n + 2 


ADDF 


MPYF 


(nop) 


ADDI ARO, ARl, RO 


n + 3 


FLOAT 


ADDF 


MPYF 


(nop) 



D = Decode, E = Execute, F= Fetch, R = Read, PC = Program Counter 

Loop counter auxiliary registers for the decrement and branch (DBR)) instruc- 
tion are regarded in the same way as they are for addressing. Therefore, the 
operation shown in Example 9-3 and Example 9-4 can also occur for this in- 
struction. 
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9.2.3 Memory Conflicts 

Memory conflicts can occur when the memory bandwidth of a physical 
memory space is exceeded. For example, RAM blocks 0 and 1 and the ROM 
block can support only two accesses per cycle. The external interface can sup- 
port only one access per cycle. Section 9.4 on page 9-21 contains some condi- 
tions under which you can avoid memory conflicts. 

Memory pipeline conflicts consist of the following four types: 

□ Program wait 

A program fetch is prevented from beginning. 

□ Program fetch incomplete 

A program fetch has begun but is not yet complete. 

□ Execute only 

An instruction sequence requires three CPU data accesses in a single 
cycle. 

□ Hold everything 

A primary or expansion bus operation must complete before another one 
can proceed. 

These four types of memory conflicts are illustrated in examples and dis- 
cussed in the paragraphs that follow. 

Program Wait 

Two conditions can prevent the program fetch from beginning: 

□ The start of a CPU data access when: 

■ Two CPU data accesses are made to an internal RAM or ROM block, 
and a program fetch from the same block is necessary. 

■ One of the external ports is starting a CPU data access, and a program 
fetch from the same port is necessary. 

□ A multicycle CPU data access or DMA data access over the external bus 
is needed. 
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Example 9-5 illustrates a program wait until a CPU data access completes. 
In this case, *AR0 and *AR1 are both pointing to data in RAM block 0, and the 
MPYF instruction will be fetched from RAM block 0. This results in the conflict 
shown in Example 9-5. Since no more than two accesses can be made to 
RAM block 0 in a single cycle, the program fetch cannot begin and must wait 
until the CPU data accesses are complete. 

Example 9-5. Program Wait Until CPU Data Access Completes 

ADDF3 *AR0,*AR1,R0 

FIX 

MPYF 

ADDF3 

NEGB 

PIPELINE OPERATION 



PC 


I ' 


I D 




I E 


n 


ADDF3 








n + 1 


FIX 


ADDF3 






n + 2 


(WAIT) 


FIX 


ADDF3 




n + 2 


MPYF 


(nop) 


FIX 


ADDF3 


n + 3 


ADDF3 


MPYF 


(nop) 


FIX 


n + 4 


NEGB 


ADDF3 


MPYF 


(nop) 



D = Decode, E = Execute, F » Fetch, R ■ Read, PC = Program Counter 

Example 9-6 shows a program wait due to a multicycle data-data access or 
a multicycle DMA access. The ADDF, MPYF, and SUBF are fetched from a 
portion of memory other than the external port that the DMA requires. The 
DMA begins a multicycle access. The program fetch corresponding to the 
CALL is made to the same external port that the DMA is using. 

Either of two cases may produce this situation: 

□ One of the following two memory boundaries is crossed: 

■ From 7F FFFFh to 80 OOOOh, or 

■ From80 9FFFhto80A000h. 

□ Code that has been cached is executed, and the instruction prior to the 
ADDF is one of the following (conditional or unconditional): 

■ a delayed branch instruction, or 

■ a delayed decrement and branch instruction. 
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Even though the DMA has the lowest priority, multicycle access cannot be 
aborted. The program fetch must therefore wait until the DMA access com- 
pletes. 

Example 9-6. Program Wait Due to Multicycle Access 



PIPELINE OPERATION 



PC 




I D 






n 


ADDF 








n + 1 


MPYF 


ADDF 






n + 2 


SUBF 


MPYF 


ADDF 




n + 3 


(WAIT) 


SUBF 


MPYF 


ADDF 


n + 3 


CALL 


(nop) 


SUBF 


MPYF 


n + 4 




CALL 


(nop) 


SUBF 



D = Decode, E = Execute, F ■ Fetch, R « Read, PC = Program Counter 

Program Fetch Incomplete 

A program fetch incomplete occurs when a program fetch requires more than 
one cycle to complete due to wait states. In Example 9-7, the MPYF and 
ADDF are fetched from memory that supports single-cycle accesses. The 
SUBF is fetched from memory, which requires one wait state. One example 
that demonstrates this conflict is a fetch across a bank boundary on the 
primary port. See Section 7.4 on page 7-30. 

Example 9-7. Multicycle Program Memory Fetches 

PIPELINE OPERATION 



PC 




I ■> 


I R 




n 


MPYF 








n + 1 


ADDF 


MPYF 






n + 2 RDY 


SUBF 


ADDF 


MPYF 




n + 2 RDY 


SUBF 


(nop) 


ADDF 


MPYF 


n + 3 


ADD I 


SUBF 


(nop) 


ADDF 



D a Decode, E = Execute, F = Fetch, R « Read, PC = Program Counter 
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Execute Only 

The execute only type of memory pipeline conflict occurs when performing an 
interlocked load or when a sequence of instructions requires three CPU data 
accesses in a single cycle. There are three cases in which this occurs: 

□ An instruction performs a store and is followed by an instruction that does 
two memory reads. 

□ An instruction performs two stores and is followed by an instruction that 
performs at least one memory read. 

□ An interlocked load (LDII or LDFI) instruction is performed, and XF1 = 1 . 

The first case is shown in Example 9-8. Since this sequence requires three 
data memory accesses and only two are available, only the execute phase of 
the pipeline is allowed to proceed. The dual reads required by the LDF || LDF 
are delayed one cycle. Note that a refetch of the next instruction can occur. 

Example 9-8. Single Store Followed by Two Reads 

STF R0,*AR1 ; RO - 
LDF *AR2,R1 ; *AR2 
|| LDF *AR3,R2 ; *AR3 

PIPELINE OPERATION 
PC | F | D | R | E 

n stf - - - 

n + 1 LDF 1 1 LDF STF - 

n + 2 W LDF 1 1 LDF STF 

n + 3 X W LDF 1 1 LDF STF 

n + 4 X W LDF || LDF (nop) 

n + 4 Y X W LDF || LDF *AR2,R1 and *AR3,R2 

D= Decode, E = Execute, F= Fetch, /?= Read, PC = Program Counter, W,X,Y- Instruction Representations 



- *AR1 

-*R1 in parallel with 
-*R2 
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Example 9-9 shows a parallel store followed by a single load or read. Since 
the two parallel stores are required, the next CPU data memory read must wait 
a cycle before beginning. One program memory refetch can occur. 

Example 9-9. Parallel Store Followed by Single Read 

STF R0,*AR0 ; RO -* *AR0 in parallel with 

|| STF R2,*AR1 R2 -* *ARl 

ADDF 8SUM,R1 ; Rl + §SUM -* Rl 

IACK 

ASH 







PIPELINE OPERATION 




PC 


I ' 




I » I 


E 


n 


STF || STF 








n + 1 


ADDF 


STF || STF 






n + 2 


IACK 


ADDF 


STF || STF 




n + 3 


ASH 


IACK 


ADDF 


STF || STF 


n + 4 


ASH 


IACK 


ADDF 


(nop) 


n + 4 




ASH 


IACK 


ADDF 



D = Decode, E = Execute, F = Fetch, R = Read, PC » Program Counter 
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The final case involves an interlocked load (LDII or LDFI) instruction and XF1 
= 1 . Since the interlocked loads use the XF1 pin as an acknowledge that the 
read can complete, the loads might need to extend the read cycle, as shown 
in Example 9-10. Note that a program refetch can occur. 

Example 9-10. Interlocked Load 







NOT 
LDII 
ADDI 
CMPI 


R1,R0 
300h,AR2 
* AR2 , R2 
R0,R2 






PIPELINE OPERATION 


PC 


I r 


I D I 


R | E 


n 


NOT 






n + 1 


LDII 


NOT 




n + 2 


ADDI 


LDII 


NOT 


n + 3 


CMPI 


ADDI 


LDII NOT 


n + 3 




CMPI 


ADDI LDII 


n + 4 




CMPI 


ADDI LDII 



D = Decode, E = Execute, F = Fetch, R = Read, PC * Program Counter 
Hold Everything 

Three situations result in hold-everything memory pipeline conflicts: 

□ A CPU data load or store cannot be performed because an external port is 
busy. 

□ An external load takes more than one cycle. 

□ Conditional calls and traps are processed. 
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The first type of hold everything conflict occurs when one of the external ports 
is busy due to an access that has started but is not complete. In Example 9-1 1 , 
the first store is a two-cycle store. The CPU writes the data to an external port. 
The port control then takes two cycles to complete the data-data write. The 
LDF is a read over the same external port. Since the store is not complete, the 
CPU continues to attempt LDF until the port is available. 

Example 9-11. Busy External Port 

STF R0,§DMA1 
LDF e DMA2.R0 

PIPELINE OPERATION 



PC 


I F 


I D 


I " 




n 


STF 








n + 1 


LDF 


STF 






n + 2 


W 


LDF 


STF 




n + 2 


W 


LDF 


(nop) 


STF 


n + 2 


W 


LDF 


(nop) 


(nop) 


n + 3 


X 


W 


LDF 


(nop) 


n + 4 


Y 


X 


W 


LDF 



D= Decode, E = Execute, F= Fetch, f?= Read, PC= Program Counter, W,X, V* Instruction Representations 
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The second type of hold everything conflict involves multicycle data reads. The 
read has begun and continues until completed. In Example 9-12, the LDF is 
performed from an external memory that requires several cycles to complete. 

Example 9-12. Multicycle Data Reads 

LDF @DMA,R0 
PIPELINE OPERATION 



PC 


I F I 


D 


I R 


I E 


n 


LDF 








n + 1 


I 


LDF 






n + 2 


J 


I 


LDF 




n + 3 


K , (dummy) 


I 


LDF 




n + 3 


K 2 


J 


I 


LDF 



D = Decode, E = Execute, F= Fetch, R ■ Read, PC » Program Counter, /, J, /<= Instruction Representations 

The final type of hold everything conflict involves conditional calls and traps, 
which are different from the other branch instructions. Whereas the other 
branch instructions are conditional loads, the conditional calls and traps are 
conditional stores, which require one cycle more than a conditional branch 
(see Example 9-1 3). The added cycle is used to push the return address after 
the call condition is evaluated. 

Example 9-13. Conditional Calls and Traps 

PIPELINE OPERATION 



PC | 




I D I 


I " 


I e 


n9 


CALLcoiid 








n + 1 


I 


CALLcond 






n + 1 


(nop) 


(nop) 


CALLcond 




n + 1 


(nop) 


(nop) 


(nop) 


CALLcond 


n + 1 


(nop) 


(nop) 


(nop) 


CALLcond 


n + 2/CALLaddr 


I 


(nop) 


(nop) 


(nop) 



D = Decode, E » Execute, F = Fetch, f? = Read, PC « Program Counter, /, - Instruction Representation 



Pipeline Operation 9-17 



Resolving Register Conflicts 



9.3 Resolving Register Conflicts 

If the auxiliary registers (AR7-AR0), the index registers (IR1 -IRO), data page 
pointer (DP), or stack pointer (SP) are accessed for any reason other than ad- 
dress generation, pipeline conflicts associated with the next memory access 
can occur. The pipeline conflicts and delays are presented in subsection 9.2 
on page 9-4. 

Example 9-14, Example 9-15, and Example 9-16 demonstrate either some 
common uses of these registers that do not produce a conflict or ways that you 
can avoid the conflict. 

Example 9-14. Address Generation Update of an AR Followed by an AR for Address 
Generation 







LDF 


7.0,R0 ; 7.0 






MPYF 


*++AR0(IRl),R0 






ADDF 


*AR2 ,R0 






FIX 








MPYF 








ADDF 








PIPELINE OPERATION 


PC 




I ■> 


| R | E 


n 


LDF 






n + 1 


MPYF 


LDF 




n + 2 


ADDF 


MPYF 


LDF - 


n + 3 


FIX 


ADDF 


MPYF LDF 


n + 4 


MPYF 


FIX 


ADDF MPYF 


n + 5 


ADDF 


MPYF 


FIX ADDF 



D= Decode, E= Execute, F= Fetch, fl= Read, PC = Program Counter, W,X, Y,Z= Instruction Representations 
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Example 9-15. Write to an AR Followed by an AR for Address Generation Without a 
Pipeline Conflict 

LDI § TABLE, AR2 

MPYF § VALUE, Rl 

ADDF R2,R1 

MPYF *AR2++,R1 
SUBF 
STF 



PIPELINE OPERATION 

PC | F | D | R | E | 

n ldi 

n + 1 MPYF LDI - 

n + 2 ADDF MPYF LDI - 

n + 3 MPYF ADDF MPYF LDI 7, 

AR2 

n + 4 SUBF MPYF ADDF MPYF 

n + 5 STF SUBF MPYF ADDF 



D = Decode, E « Execute, F = Fetch, R » Read, PC » Program Counter 
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Example 9-16. Write to DP Followed by a Direct Memory Read Without a Pipeline Conflict 



LDP TABLE_ADDR 

POP RO 

LDF *-AR3(2),Rl 

LDI 6 TABLE_ADDR , ARO 

PUSHF R6 

PUSH R4 



PIPELINE OPERATION 
PC | F | D | R | E | 

n ldp - 

n + 1 pop ldp - - 

n + 2 LDF POP LDP - 

n + 3 LDI LDF POP LDP 

n + 4 PUSHF LDI LDF POP 

n + 5 PUSH PUSHF LDI LDF 

D * Decode, E = Execute, F « Fetch, R = Read, PC ■ Program Counter 
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9.4 Resolving Memory Conflicts 

If program fetches and data accesses are performed in such a manner that the 
resources being used cannot provide the necessary bandwidth, the program 
fetch is delayed until the data access is complete. Certain configurations of 
program fetch and data accesses yield conditions under which the 
TMS320C3x can achieve maximum throughput. 

Table 9-1 shows how many accesses can be performed from the different 
memory spaces when it is necessary to do a program fetch and a single data 
access and still achieve maximum performance (one cycle). As shown in 
Table 9-1 , four cases achieve one-cycle maximization. 

Table 9-1. One Program Fetch and One Data Access for Maximum Performance 







Accesses From 


Expansion Bust 




Primary Bus 


Dual-Access 


Or Peripheral 


Case# 


Accesses 


Internal Memory 


Accesses 


1 


1 


1 




2 


1 




1 






2 from any 




3 




combination 








of internal memory 




4 




1 


1 



t The expansion bus is available only on the TMS320C30. 
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Table 9-2 shows how many accesses can be performed from the different 
memory spaces when it is necessary to do a program fetch and two data ac- 
cesses and still achieve maximum performance (one cycle). Six conditions 
achieve this maximization. 



Table 9-2. One Program Fetch and Two Data Accesses for Maximum Performance 



Accesses From Expansiont Or 
Primary Bus Dual-Access Peripheral Bus 

Case # Accesses Internal Memory Accesses 



1 


1 


2 from any 
combination 
of internal memory 




2t 


1 Program 


1 Data 


1 Data 


3t 


1 Data 


1 Data 


1 Program 


4 




2 from same internal 
memory block and 
1 from a different 
internal memory 
block 




5 




3 from different 
internal memory 
blocks 




6 




2 from any 
combination 
of internal memory 


1 



t The expansion bus is available only on the TMS320C30. 
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9.5 Clocking of Memory Accesses 



This section uses the relationships between internal clock phases (H1 and H3) 
to memory accesses to illustrate how the TMS320C3x handles multiple 
memory accesses. Whereas the previous section discusses the interaction 
between sequences of instructions, this section discusses the flow of data on 
an individual instruction basis. 

Each major clock period of 60 ns is composed of two minor clock periods of 
30 ns, labeled H3 and H1 . The active clock period for H3 and H1 is the time 
when that signal is high. 



The precise operation of memory reads and writes can be defined according 
to these minor clock periods. The types of memory operations that can occur 
are program fetches, data loads and stores, and DMA accesses. 



Internal program fetches are always performed during H3 unless a single data 
store must occur at the same time due to another instruction in the pipeline. 
In this case, the program fetch occurs during H1 , and the data store during H3. 

External program fetches always start at the beginning of H3, with the address 
being presented on the external bus. At the end of H 1 , they are completed with 
the latching of the instruction word. 



Major Clock Period 



H1 




9.5.1 Program Fetches 



Pipeline Operation 9-23 



Clocking of Memory Accesses 



9.5.2 Data Loads and Stores 

Four types of instructions perform loads, memory reads, and stores: 

□ Two-operand instructions, 

□ Three-operand instructions, 

□ Multiplier/ALU operation with store instructions, and 

□ Parallel multiply and add instructions. 

See Chapter 5 for detailed information on addressing modes. 

As discussed in Chapter 7, the number of bus cycles for external memory 
accesses differs in some cases from the number of CPU execution cycles. For 
external reads, the number of bus cycles and CPU execution cycles is identi- 
cal. For external writes, there are always at least two bus cycles, but unless 
there is a port access conflict, there is only one CPU execution cycle. In the 
following examples, any difference in the number of bus cycles and CPU 
cycles is noted. 

Two-Operand Instruction Memory Accesses 

Two-operand instructions include all instructions whose bits 31-29 are 000 or 
010 (see Figure 9-2). In the case of a data read, bits 15-0 represent the $rc 
operand. Internal data reads are always performed during H1 . External data 
reads always start at the beginning of H3, with the address being presented 
on the external bus; they complete with the latching of the data word at the end 
of H1. 

Figure 9-2. Two-Operand Instruction Word 
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In the case of a data store, bits 1 5-0 represent the dst operand. Internal data 
stores are performed during H3. External data stores always start at the begin- 
ning of H3, with the address and data being presented on the external bus. 

Three-Operand Instruction Memory Reads 

Three-operand instructions include all instructions whose bits 31-29 are 001 
(see Figure 9-3). The source operands, srd and src2, come from either regis- 
ters or memory. When one or more of the source operands are from memory, 
these instructions are always memory reads. 
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Figure 9-3. Three-Operand Instruction Word 
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If only one of the source operands is from memory (either srd or src2) and is 
located in internal memory, the data is read during H1 . If the single memory 
source operand is in external memory, the read starts at the beginning of H3, 
with the address being presented on the external bus, and completes with the 
latching of the data word at the end of H1 . 

If both source operands are to be fetched from memory, several cases occur. 
If both operands are located in internal memory, the srd read is performed 
during H3 and the src2 read during H1 , thus completing two memory reads in 
a single cycle. 

If srd is in internal memory and src2 is in external memory, the src2 access 
begins at the start of H3 and latches at the end of H1 . At the same time, the 
srd access to internal memory is performed during H3. Again, two memory 
reads are completed in a single cycle. 

If srd is in external memory and src2 is in internal memory, two cycles are nec- 
essary to complete the two reads. In the first cycle, both operands are ad- 
dressed. Since srd takes an entire cycle to be read and latched from external 
memory, the internal operation on src2 cannot be completed until the second 
cycle. Ordering the operands so that srd is located internally is necessary to 
achieve single-cycle execution. 

If srd and src2 are both from external memory, two cycles are required to com- 
plete the two reads. In the first cycle, the srd access is performed and loaded 
on the next H3; in the second cycle, the src2 access is performed and loaded 
on that cycle's H1. 

If src2 is in external memory and srd is in on-chip or external memory and is 
immediately preceded by a single store instruction to external memory, a 
dummy src2 read can occur between the execution of the store instruction and 
the src2 read, regardless of which memory space is accessed (STRB, 
MSTRB, or IOSTRB). The dummy read can cause an externally interfaced 
FIFO address pointer to be incremented prematurely, thereby causing the loss 
of FIFO data. Example 9-17 illustrates how the dummy read can occur. 
Example 9-1 8 offers an alternative code segment that suppresses the dummy 
read. In the alternative code segment, the dummy read is eliminated by swap- 
ping the order of the source operands. 
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Example 9-1 7. Dummy src2 Read 



STI R0,*AR6 ; AR6 points to MSTRB space 

ADD 1 3 *AR3,*AR1,R0 ; AR3 points to on-ch ip RAM 

; AR1 points to MSTRB space 

PIPELINE OPERATION 
PC | F | D | R | E | 

n sti 

n + 1 ADDI3 STI 

n + 2 ADDI3 STI 

n + 3 - STI R0,*AR6 

The read of src2 cannot start 
n + 4 — — until the store is complete. 

n + 5 ADDI3 — dummy load of src2 

n + 6 — — second cycle of dummy load 

n + 7 ADDI3 — actual read of src2 and srcl 

n + 8 ADDI3 *AR3,*AR1,R0 



D = Decode, E = Execute, F = Fetch, R = Read, PC = Program Counter 

Two cy cles are required for the MSTRB store. Two other cycles are required for the 
dummy MST RB read of *AR3 (because the read follows a write). One cycle is required 
for an actual MSTRB read of *AR3. 
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Example 9-18. Operand Swapping Alternative 



Switch the operands of the three-operand instruction so that the internal read 
is performed first. 



STI R0,*AR6 ;AR6 points to MSTRB space 

ADD 1 3 *AR1,* AR3.R0 ;AR3 points to on-ch ip RAM 

;AR1 points to MSTRB space 



H1 
H3 



PC 
n 

n + 1 
n + 2 
n + 3 
n + 4 
n + 5 
n + 6 
n + 7 



PIPELINE OPERATION 
| D | R | 



STI 
ADD 1 3 



STI 



ADDI3 



STI 



ADDI3 



STI R0,*AR6 

— The read of src2 cannot start 
until the store is complete. 

— actual read of src2 and srcl 

— second cycle of src2 read 
ADD 1 3 *AR1,*AR3,R0 



D = Decode, E = Execute, F = Fetch, R = Read, PC * Program Counter 
Operations with Parallel Stores 

The next class of instructions includes every instruction that has a store in par- 
allel with another instruction. Bits 31 and 30 for these instructions are equal 
to1 1. 

The instruction word format for those operations that perform a multiply or ALU 
operation in parallel with a store is shown in Figure 9-4. If the store operation 
to dst2 is external or internal, it is performed during H3. Two bus cycles are 
required for external stores, but only one CPU cycle is necessary to complete 
the write. 

If the memory read operation is external, it starts at the beginning of H3 and 
latches at the end of H1. If the memory read operation is internal, it is per- 
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formed during H1 . Note that memory reads are performed by the CPU during 
the read (R) phase of the pipeline, and stores are performed during the ex- 
ecute (E) phase. 

Figure 9-4. Multiply or CPU Operation With a Parallel Store 
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The instruction word format for those instructions that have parallel stores to 
memory is shown in Figure 9-5. If both destination operands, dst1 and dst2, 
are located in internal memory, dst1 is stored during H3 and dst2 during H1, 
thus completing two memory stores in a single cycle. 

If dst1 is in external memory and dst2\s in internal memory, the dst1 store be- 
gins at the start of H3. The dst2 store to internal memory is performed during 
H 1 . Two bus cycles are required for the external store, but only one CPU cycle 
is necessary to complete the write. Again, two memory stores are completed 
in a single cycle. 

If dst1 is in internal memory and dst2\s in external memory, an additional bus 
cycle is necessary to complete the dst2 store. Only one CPU cycle is neces- 
sary to complete the write, but the port access requires three bus cycles. In the 
first cycle, the internal dst1 store is performed during H3, and dst2 is written 
to the port during H1 . During the next cycle, the dst2 store is performed on the 
external bus, beginning in H3, and executes as normal through the following 
cycle. 

If dst1 and dst2 are both written to external memory, a single CPU cycle is still 
all that is necessary to complete the stores. In this case, four bus cycles are 
required. 

1 ) In the first cycle, both dst1 and dst2 are written to the port, and the external 
bus access for dst1 begins. 

2) The store for dst1 is completed on the second cycle, and the store for dst2 
begins on the third external bus cycle. 

3) Finally, the store for dst2 is completed on the fourth external bus cycle. 
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Figure 9-5. Two Parallel Stores 
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Parallel Multiplies and Adds 

Memory addressing for parallel multiplies and adds is similar to that for three- 
operand instructions. The parallel multiplies and adds include all instructions 
whose bits 31-30 = 10 (see Figure 9-6). 

For these operations, src3 and src4 are both located in memory. If both oper- 
ands are located in internal memory, src3 is performed during H3, and src4 is 
performed during H1 , thus completing two memory reads in a single cycle. 

If src3 is in internal memory and src4 is in external memory, the src4 access 
begins at the start of H3 and latches at the end of H1 . At the same time, the 
src3 access to internal memory is performed during H3. Again, two memory 
reads are completed in one cycle. 

If src3 is in external memory and src4 is in internal memory, two cycles are nec- 
essary to complete the two reads. In the first cycle, the internal src4 access 
is performed. During the H3 of the next cycle, the src3 access is performed. 

If src3 and src4 are both from external memory, two cycles are necessary to 
complete the two reads. In the first cycle, the src3 access is performed; in the 
second cycle, the src4 access is performed. 



Figure 9-6. Parallel Multiplies and Adds 
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Chapter 10 



Assembly Language Instructions 



The TMS320C3x assembly language instruction set supports numeric-inten- 
sive, signal-processing, and general-purpose applications. The instructions 
are organized into major groups consisting of load-and-store, two- or three-op- 
erand arithmetic/logical, parallel, program-control, and interlocked operations 
instructions. The addressing modes used with the instructions are described 
in Chapter 5. 

The TMS320C3x instruction set can also use one of 20 condition codes with 
any of the 1 0 conditional instructions, such as LDFcond. This chapter defines 
the condition codes and flags. 

The assembler allows optional syntax forms to simplify the assembly language 
for special-case instructions. These optional forms are listed and explained. 

Each of the individual instructions is described and listed in alphabetical order 
(see subsection 1 0.3.2 on page 1 0-1 6). Example instructions demonstrate the 
special format and explain its content. 

This chapter discusses the following major topics: 



Topic 


Page 
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10.1 Instruction Set 

All of the instructions in the TMS320C3x instruction set are one machine word 
long. Most require one cycle to execute. All instructions are a single machine 
word long, and most instructions require one cycle to execute. In addition to 
multiply and accumulate instructions, the TMS320C3x possesses a full com- 
plement of general-purpose instructions. 

The instruction set contains 1 1 3 instructions organized into the following func- 
tional groups: 

□ Load-and-store 

□ Two-operand arithmetic/logical 

□ Three-operand arithmetic/logical 

□ Program control 

□ Interlocked operations 

□ Parallel operations 

Each of these groups is discussed in the succeeding subsections. 

10.1.1 Load-and-Store Instructions 

The TMS320C3x supports 12 load-and-store instructions (see Table 10-1). 
These instructions can: 

□ Load a word from memory into a register, 

□ Store a word from a register into memory, or 

□ Manipulate data on the system stack. 

Two of these instructions can load data conditionally. This is useful for locating 
the maximum or minimum value in a data set. See Section 1 0.2 on page 1 0-1 0 
for detailed information on condition codes. 



Table 10-1. Load-and-Store Instructions 



Instruction 


Description 


Instruction 


Description 


LDE 


Load floating-point exponent 


POP 


Pop integer from stack 


LDF 


Load floating-point value 


POPF 


Pop floating-point value from stack 


LDFcond 


Load floating-point value 
conditionally 


PUSH 


Push integer on stack 


LDI 


Load integer 


PUSHF 


Push floating-point value on stack 


LD\cond 


Load integer conditionally 


STF 


Store floating-point value 


LDM 


Load floating-point mantissa 


STI 


Store integer 


LDP 


Load data page pointer 
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10.1.2 Two-Operand Instructions 

The TMS320C3x supports 35 two-operand arithmetic and logical instructions. 
The two operands are the source and destination. The source operand can be 
a memory word, a register, or a part of the instruction word. The destination 
operand is always a register. 

As shown in Table 10-2, these instructions provide integer, floating-point, or 
logical operations, and multiprecision arithmetic. 



Table 10-2. Two-Operand Instructions 



Instruction 


Description 


Instruction 


Description 


ABSF 


Absolute value of a floating- 
point number 


NORM 


Normalize floating-point value 


ABSI 


Absolute value of an integer 


NOT 


Bitwise logical-complement 


ADDCt 


Add integers with carry 


ORT 


Bitwise logical-OR 


ADDFt 


Add floating-point values 


RND 


Round floating-point value 


ADDlt 


Add integers 


ROL 


Rotate left 


ANDt 


Bitwise logical-AND 


ROLC 


Rotate left through carry 


ANDNt 


Bitwise logical-AND with 
complement 


ROR 


Rotate right 


ASHt 


Arithmetic shift 


RORC 


Rotate right through carry 


CMPFt 


Compare floating-point values 


SUBBt 


Subtract integers with borrow 


CMPlt 


Compare integers 


SUBC 


Subtract integers conditionally 


FIX 


Convert floating-point value to 
integer 


SUBF 


Subtract floating-point values 


FLOAT 


Convert integer to floating-point 
value 


SUBI 


Subtract integer 


LSHt 


Logical shift 


SUBRB 


Subtract reverse integer with 
borrow 


MPYFt 


Multiply floating-point values 


SUBRF 


Subtract reverse floating-point 
value 


MPYlt 


Multiply integers 


SUBRI 


Subtract reverse integer 


NEQB 


Negate integer with borrow 


TSTBt 


Test bit fields 


NEGF 


Negate floating-point value 


XORt 


Bitwise exclusive-OR 


NEQI 


Negate integer 







t Two- and three-operand versions 
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10.1.3 Three-Operand Instructions 

Most instructions have only two operands; however, some arithmetic and log- 
ical instructions have three-operand versions. The 17 three-operand instruc- 
tions allow the TMS320C3x to read two operands from memory or the CPU 
register file in a single cycle and store the results in a register. The following 
factors differentiate the two- and three-operand instructions: 

□ Two-operand instructions have a single source operand (or shift count) 
and a destination operand. 

□ Three-operand instructions can have two source operands (or one source 
operand and a count operand) and a destination operand. A source oper- 
and can be a memory word or a register. The destination of a three-oper- 
and instruction is always a register. 

Table 10-3 lists the instructions that have three-operand versions. Note that 
you can omit the 3 in the mnemonic from three-operand instructions (see sub- 
section 10.3.2 on page 10-16). 



Table 10-3. Three-Operand Instructions 



Instruction 


Description 


Instruction 


Description 


ADDC3 


Add with carry 


MPYF3 


Multiply floating-point values 


ADDF3 


Add floating-point values 


MPYI3 


Multiply integers 


ADDI3 


Add integers 


OR3 


Bitwise logical-OR 


AND3 


Bitwise logical-AND 


SUBB3 


Subtract integers with borrow 


ANDN3 


Bitwise logical-AND with complement 


SUBF3 


Subtract floating-point values 


ASH3 


Arithmetic shift 


SUBI3 


Subtract integers 


CMPF3 


Compare floating-point values 


TSTB3 


Test bit fields 


CMPI3 


Compare integers 


XOR3 


Bitwise exclusive-OR 


LSH3 


Logical shift 
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10.1.4 Program-Control Instructions 

The program-control instruction group consists of all of those instructions (1 7) 
that affect program flow. The repeat mode allows repetition of a block of code 
(RPTB) or of a single line of code (RPTS). Both standard and delayed 
(single-cycle) branching are supported. Several of the program control instruc- 
tions are capable of conditional operations (see Section 11.2 on page 11-6 
for detailed information on condition codes). Table 1 0-4 lists the program con- 
trol instructions. 



Table 10-4. Program Control Instructions 



Instruction 


Description 


Instruction 


Description 


Bcond 


Branch conditionally (standard) 


IDLE 


Idle until interrupt 


BconcD 


Branch conditionally (delayed) 


NOP 


No operation 


BR 


Branch unconditionally (standard) 


RETlcond 


Return from interrupt conditionally 


BRD 


Branch unconditionally (delayed) 


RETScond 


Return from subroutine 
conditionally 


CALL 


Call subroutine 


RPTB 


Repeat block of instructions 


CALLcond 


Call subroutine conditionally 


RPTS 


Repeat single instruction 


DBcond 


Decrement and branch 
conditionally (standard) 


SWI 


Software interrupt 


DBcondD 


Decrement and branch 
conditionally (delayed) 


TRAP cone/ 


Trap conditionally 


IACK 


Interrupt acknowledge 







10.1.5 Low-Power Control Instructions 

The low-power control instruction group consists of three instructions that af- 
fect the low-power modes. The low-power idle (IDLE2) instruction allows ex- 
tremely low-power mode. The divide-clock-by- 16 (LOPOWER) instruction re- 
duces the rate of the input clock frequency. The restore-clock-to-regular- 
speed (MAXSPEED) instruction causes the resumption of full-speed opera- 
tion. Table 10-5 lists the low-power control instructions. 



Table 10-5. Low-Power Control Instructions 



Instruction 


Description 


Instruction Description 


IDLE2 


Low-power idle 


MAXSPEED Restore clock to regular speed 


LOPOWER 


Divide clock by 16 
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10.1.6 Interlocked-Operations Instructions 

The interlocked operations instructions (Table 10-6) support multiprocessor 
communication and the use of external signals to allow for powerful synchroni- 
zation mechanisms. The instructions also guarantee the integrity of the com- 
munication and result in a high-speed operation. Refer to Chapter 6 for exam- 
ples of the use of interlocked instructions. 



Table 10-6. Interlocked Operations Instructions 



Instruction 


Description 


Instruction 


Description 


LDFI 


Load floating-point value, interlocked 


STFI 


Store floating-point value, inter- 
locked 


LDII 


Load integer, interlocked 


STII 


Store integer, interlocked 


SIGI 


Signal, interlocked 
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10.1.7 Parallel-Operations Instructions 



The parallel-operations instructions group makes a high degree of parallelism 
possible. Some of the TMS320C3x instructions can occur in pairs that will be 
executed in parallel. These instructions offer the following features: 

□ Parallel loading of registers, 

□ Parallel arithmetic operations, or 

□ Arithmetic/logical instructions used in parallel with a store instruction. 

Each instruction in a pair is entered as a separate source statement. The sec- 
ond instruction in the pair must be preceded by two vertical bars (||). 
Table 10-7 lists the valid instruction pairs. 



Table 10-7. Parallel Instructions 



Mnemonic 



Description 



ABSF 
||STF 

ABSI 

II sn 

ADDF3 
||STF 

ADDI3 
II sn 

AND3 
II sn 

ASH3 

II sn 

FIX 

II sn 

FLOAT 
||STF 

LDF 
IISTF 

LDI 

II sn 

LSH3 

II sn 

MPYF3 
IISTF 

MPYI3 
IISTI 



Parallel Arithmetic with Store Instructions 

Absolute value of a floating-point number and store floating-point value 

Absolute value of an integer and store integer 

Add floating-point values and store floating-point value 

Add integers and store integer 

Bitwise logical-AND and store integer 

Arithmetic shift and store integer 

Convert floating-point to integer and store integer 

Convert integer to floating-point value and store floating-point value 

Load floating-point value and store floating-point value 

Load integer and store integer 

Logical shift and store integer 

Multiply floating-point values and store floating-point value 
Multiply integer and store integer 
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Table 10-7. Parallel Instructions (Continued) 



Mnemonic 



Description 



NEGF 
HSTF 

NEGI 

II sn 

NOT 

||STI 

OR3 

II sn 

STF 
HSTF 

STI 
IISTI 

SUBF3 
HSTF 

SUBI3 
IISTI 

XOR3 
IISTI 



LDF 
II LDF 

LDI 
IILDI 



MPYF3 
|| ADDF3 

MPYF3 
|| SUBF3 

MPYI3 
|| ADDI3 

MPYI3 
II SUBI3 



Parallel Arithmetic with Store Instructions (Concluded) 
Negate floating-point value and store floating-point value 

Negate integer and store integer 

Complement value and store integer 

Bitwise logical-OR value and store integer 

Store floating-point values 

Store integers 

Subtract floating-point value and store floating-point value 
Subtract integer and store integer 
Bitwise exclusive-OR values and store integer 
Parallel Load Instructions 

Load floating-point 
Load integer 

Parallel Multiply and Add/Subtract Instructions 

Multiply and add floating-point 

Multiply and subtract floating-point 
Multiply and add integer 
Multiply and subtract integer 
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10.1.8 Illegal Instructions 

The TMS320C3x has no illegal instruction-detection mechanism. Fetching an 
illegal (undefined) opcode can cause the execution of an undefined operation. 
Proper use of the Tl TMS320 floating-point software tools will not generate an 
illegal opcode. Only the following can cause the generation of an illegal op- 
code: 

□ Misuse of the tools 

□ An error in the ROM code 

□ Defective RAM 
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10.2 Condition Codes and Flags 

The TMS320C3X provides 20 condition codes (00000-10100, excluding 
01011) that you can place in the cone/field of any of the conditional instructions, 
such as RETScortdor LDF cond. The conditions include signed and unsigned 
comparisons, comparisons to 0, and comparisons based on the status of indi- 
vidual condition flags. Note that all conditional instructions can accept the suf- 
fix U to indicate unconditional operation. 

Seven condition flags provide information about properties of the result of 
arithmetic and logical instructions. The condition flags are stored in the status 
register (ST) and are affected by an instruction only when either of the follow- 
ing two cases occurs: 

□ The destination register is one of the extended-precision registers 
(R7-R0). (This allows for modification of the registers used for addressing 
but does not affect the condition flags during computation.) 

□ The instruction is one of the compare instructions (CMPF, CMPF3, CMPI, 
CMPI3, TSTB, or TSTB3). (This makes it possible to set the condition flags 
according to the contents of any of the CPU registers.) 

The condition flags can be modified by most instructions when either of the 
preceding conditions is established and either of the following two cases oc- 
curs: 

□ A result is generated when the specified operation is performed to infinite 
precision. This is appropriate for compare and test instructions that do not 
store results in a register. It is also appropriate for arithmetic instructions 
that produce underflow or overflow. 

□ The output is written to the destination register, as shown in Table 10-8. 
This is appropriate for other instructions that modify the condition flags. 

Table 10-8. Output Value Formats 



Type Of Operation 


Output Format 


Floating-point 


8-bit exponent, one sign bit, 31 -bit fraction 


Integer 


32-bit integer 


Logical 


32-bit unsigned integer 



Figure 10-1 on page 10-11 shows the condition flags in the low-order bits of 
the status register. Following the figure is a list of status register condition flags 
and descriptions of how the flags are set by most instructions. For specific de- 
tails of the effect of a particular instruction on the condition flags, see the de- 
scription of that instruction in subsection 10.3.3 on page 10-18. 
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Figure 10-1. Status Register 

31 30 29 26 27 26 25 24 23 22 21 20 19 18 17 16 
xxl xx I xx|xx|xx|xx|xx| XX I XX I XX I XX I XX I XX I XX I XX I XX 



15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 
| xx | xx | QIE | CC | CE | CF | xx | RM | OVm| W | lV [ V? \ N | Z \ V f 0 | 

R/WR/WR/WR/W FVWR/WR/WR/WR/WR/WR/WR/WR/W 

NOTE: xx = reserved bit 

R = read, W = write 

LUF Latched Floating-Point Underflow Condition Flag 

LUF is set whenever UF (floating-point underflow flag) is set. LUF can be 
cleared only by a processor reset or by modifying it in the status register (ST). 

LV Latched Overflow Condition Flag 

LV is set whenever V (overflow condition flag) is set. Otherwise, it is un- 
changed. LV can be cleared only by a processor reset or by modifying it in the 
status register (ST). 

UF Floating-Point Underflow Condition Flag 

A floating-point underflow occurs whenever the exponent of the result is less 
than or equal to -1 28. If a floating-point underflow occurs, UF is set, and the 
output value is set to 0. UF is cleared if a floating-point underflow does not oc- 
cur. 

N Negative Condition Flag 

Logical operations assign N the state of the MSB of the output value. For inte- 
ger and floating-point operations, N is set if the result is negative, and cleared 
otherwise. Zero is positive. 

Z Zero Condition Flag 

For logical, integer, and floating-point operations, Z is set if the output is 0 and 
cleared otherwise. 
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V Overflow Condition Flag 

For integer operations, V is set if the result does not fit into the format specified 
for the destination (that is, -2 32 * result * 2 32 - 1). Otherwise, V is cleared. 
For floating-point operations, V is set if the exponent of the result is greater 
than 127; otherwise.V is cleared. Logical operations always clear V. 

C Carry Flag 

When an integer addition is performed, C is set if a carry occurs out of the bit 
corresponding to the MSB of the output. When an integer subtraction is per- 
formed, C is set if a borrow occurs into the bit corresponding to the MSB of the 
output. Otherwise, for integer operations, C is cleared. The carry flag is unaf- 
fected by floating-point and logical operations. For shift instructions, this flag 
is set to the final value shifted out; for a 0 shift count, this is set to 0. 

Table 10-9 lists the condition mnemonic, code, description, and flag for each 
of the 20 condition codes. 
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Table 10-9. Condition Codes and Flags 



wunuiuun 


V/UUe 


L/Bscnpiion 


riagi 






Unconditional Compares 




u 


00000 


Unconditional 


Don't care 






Unsigned Compares 




LO 


00001 


Lower than 


C 


LS 


00010 


Lower than or same as 


CORZ 


HI 


00011 


Higher than 


~C AND-Z 


HS 


00100 


Higher than or same as 


~C 


EQ 


00101 


Equal to 


Z 


NE 


00110 


Not equal to 


~z 






Signed Compares 




LT 


00111 


Less than 


N 


LE 


01000 


Less than or equal to 


NORZ 


GT 


01001 


Greater than 


~N AND ~Z 


GE 


01010 


Greater than or equal to 


~N 


EQ 


00101 


Equal to 


Z 


NE 


00110 


Not equal to 


~Z 






Compare to Zero 




Z 


00101 


Zero 


z 


NZ 


00110 


Not zero 


~z 


P 


01001 


Positive 


~N AND ~Z 


N 


00111 


Negative 


N 


NN 


01010 


Nonnegative 


~N 






Compare to Condition Flags 




NN 


01010 


Nonnegative 


~N 


N 


00111 


M ©native 

1 Will V w 


N 


NZ 


00110 


Nonzero 




z 


00101 


Zero 


2 


NV 


01100 


No overflow 


V 


V 


01101 


Overflow 

\^ Vvl 1 1 W WW 


V 


NUF 


01110 


No underflow 


-UF 

w 1 


UF 


01111 


Underflow 


UF 


NC 


00100 


No carry 


~C 


C 


00001 


Carry 


C 


NLV 


10000 


No latched overflow 


~LV 


LV 


10001 


Latched overflow 


LV 


NLUF 


10010 


No latched floating-point underflow 


-LUF 


LUF 


10011 


Latched floating-point underflow 


LUF 


ZUF 


10100 


Zero or floating-point underflow 


ZORUF 



t — = logical complement (not-true condition) 
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10.3 Individual Instructions 

This section contains the individual assembly language instructions for the 
TMS320C3x. The instructions are listed in alphabetical order. Information for 
each instruction includes assembler syntax, operation, operands, encoding, 
description, cycles, status bits, mode bit, and examples. 

Definitions of the symbols and abbreviations, as well as optional syntax forms 
allowed by the assembler, precede the individual instruction description sec- 
tion. Also, an example instruction shows the special format used and explains 
its content. 

A functional grouping of the instructions, as well as a complete instruction set 
summary, can be found in Section 10.1 on page 10-2. Appendix A lists the 
opcodes for all of the instructions. Refer to Chapter 5 for information on 
memory addressing. Code examples using many of the instructions are pro- 
vided in Chapter 1 1 . 

10.3.1 Symbols and Abbreviations 

Table 1 0-1 0 lists the symbols and abbreviations used in the individual instruc- 
tion descriptions. 
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Table 10-10. Instruction Symbols 



oyulDOi 


Maonlnn 

Moaning 


src 


Source operand 


srd 


Source operand 1 


src2 


Source operand 2 


src3 


Source ooerand 3 

WWMI WW WWWI Wit IW W^ 


srcA 


Source operand 4 


dst 


Destination operand 


dst\ 


Destination operand 1 


dst2 


Destination ooerand 2 

LrWWUI IVlUVI 1 WWWI Wtl IW mm 


disp 


Displacement 


cond 


Condition 


count 


Shift count 


G 


General addressing modes 


T 


Three-operand addressing modes 


r 


Parallel addressing modes 


B 


Conditional-branch addressing modes 


M 


Absolute value of x 


x — y 


Assign the value of x to destination y 


y\i 1 1 IGU If 


MantiQQA fialH /Qinn 4> irart\cin\ of * 

IVICMIUOOCi IIC7IU yOl^l 1 T IICUsllVJM^ V/l A 


x(exp) 


Exponent field of x 


op1 




||o P 2 


Operation 1 performed in parallel with operation 2 


xANDy 


Bitwise logical-AND of x and y 


xORy 


Bitwise logical-OR of x and y 


xXORy 


Bitwise logical-XOR of x and y 


~x 


Bitwise logical-complement of x 


x«y 


Shift x to the lefty bits 


x»y 


Shift x to the right y bits 


*++SP 


Increment SP and use incremented SP as address 


*SP-- 


Use SP as address and decrement SP 


ARn 


Auxiliary register n 


IRn 


Index register n 


Rn 


Register address n 


RC 


Repeat count register 


RE 


Repeat end address register 


RS 


Repeat start address register 


ST 


Status register 


C 


Carry bit 


GIE 


Global interrupt enable bit 


N 


Trap vector 


PC 


Program counter 


RM 


Repeat mode flag 


SP 


System stack pointer 



Assembly Language Instructions 1 0-1 5 



Individual Instructions 



10.3.2 Optional Assembler Syntax 

The assembler allows a relaxed syntax form for some instructions. These op- 
tional forms simplify the assembly language so that special-case syntax can 
be ignored. Following is a list of these optional syntax forms. 

□ You can omit the destination register on unary arithmetic and logical oper- 
ations when the same register is used as a source. For example, 

ABSI R0,R0 can be written as ABSI R0. 

Instructions affected: ABSI, ABSF, FIX, FLOAT, NEGB, NEGF, NEGI, 
NORM, NOT, RND 

□ You can write all three-operand instructions without the 3. For example, 

ADDI3 R0,R1 ,R2 can be written as ADDI R0,R1 ,R2. 

Instructions affected: ADDC3, ADDF3, ADDI3, AND3, ANDN3, ASH3, 
LSH3, MPYF3, MPYI3, OR3, SUBB3, SUBF3, SUBI3, XOR3 

This also applies to all of the pertinent parallel instructions. 

□ You can write all three-operand comparison instructions without the 3. For 
example, 

CMPI3 R0,*AR0 can be written as CMPI R0,*AR0. 

Instructions affected: CMPI3, CMPF3, TSTB3 

□ Indirect operands with an explicit 0 displacement are allowed. In three-op- 
erand or parallel instructions, operands with 0 displacement are automati- 
cally converted to no-displacement mode. For example: 

LDI *+AR0(0),R1 is legal. 

Also 

ADDI3 *+AR0(0),R1,R2 is equivalent to ADDI3*AR0,R1,R2. 

□ You can write indirect operands with no displacement, in which case a dis- 
placement of 1 is assumed. For example, 

LDI *AR0++(1),R0 can be written as LDI *AR0++,R0. 

□ All conditional instructions accept the suffix U to indicate unconditional op- 
eration. Also, you can omit the U from unconditional short branch instruc- 
tions. For example: 



BU label can be written as B label. 

□ You can write labels with or without a trailing colon. For example: 

labelO: NOP 
labeh NOP 

Iabel2: (Label assembles to next source line.) 
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□ Empty expressions are not allowed for the displacement in indirect mode: 
LDI *+ARO0,RO is not legal. 

□ You can precede long immediate mode operands (destination of BR and 
CALL) with an @ sign: 

BR label can be written as BR ©label. 

□ You can use the LDP pseudo-op to load a register (usually DP) with the 
eight MSBs of a relocatable address: 

LDP addr,REG or LDP ©addr.REG 

The @ sign is optional. 

If the destination REG is the DP, you can omit the DP in the operand. LDP 
generates an LDI instruction with an immediate operand and a special re- 
location type. 

□ You can write parallel instructions in either order. For example: 

ADDI can be written as STI 

|| STI || ADDI. 

□ You can write the parallel bars indicating part 2 of a parallel instruction any- 
where on the line from column 0 to the mnemonic. For example: 

ADDI can be written as ADDI 

|| STI || STI. 

□ If the second operand of a parallel instruction is the same as the third (des- 
tination register) operand, you can omit the third operand. This allows you 
to write three-operand parallel instructions that look like normal two-oper- 
and instructions. For example, 

ADDI *AR0,R2,R2 can be written as ADD *AR0,R2 
|| MPYI *AR1 ,R0,R0 || MPYI *AR1 ,R0. 

Instructions (applies to all parallel instructions that have a register second 
operand) affected: ADDI, ADDF, AND, MPYI, MPYF, OR, SUBI, SUBF, 
and XOR. 

□ You can write all commutative operations in parallel instructions in either 
order. For example, you can write the ADDI part of a parallel instruction in 
either of two ways: 

ADDI *AR0,R1,R2 or ADDI R1,*AR0,R2. 

Instructions affected: parallel instructions containing any of ADDI, ADDF, 
MPYI, MPYF, AND, OR, and XOR. 
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□ Use the syntax in Table 10-11 to designate CPU registers in operands. 
Note the alternate notation Rn, 0 < n <; 27, which is used to designate 
any CPU register. 

Table 10-11. CPU Register Syntax 





AltarnatA 




Q%#nfflv 
wy liiOA 


nv^|i9&vi wyi liciA 


Assianod Function 


R0 


RO 


Extended-precision register 


R1 


R1 


Extended-precision register 


R2 


R2 


Extended-precision register 


R3 


R3 


Extended-precision register 


DA 


DA 


txiena ©a -precis ion register 


R5 


R5 


Extended-precision register 


R6 


R6 


Extended-precision register 


R7 


R7 


Extended-precision register 


ARO 


R8 


Auxiliary register 


Am 


R9 


Auxiliary register 


AR2 


R10 


Auxiliary register 


AR3 


R11 


Auxiliary register 


AR4 


R12 


Auxiliary register 


AR5 


R13 


auxiliary register 


AR6 


R14 


Auxiliary register 


AR7 


R15 


Auxiliary reaister 


DP 


R16 


Data-page pointer 


IRO 


R17 


Index register 0 


IR1 


R18 


Index register 1 


BK 


R19 


Block-size register 


SP 


R20 


Active stack pointer 


ST 


R21 


Status register 


IE 


R22 


CPU/DMA interrupt enable 


IF 


R23 


CPU interrupt flags 


IOF 


R24 


I/O flags 


RS 


R25 


Repeat start address 


RE 


R26 


Repeat end address 


RC 


R27 


Repeat counter 



10.3.3 Individual Instruction Descriptions 

Each assembly language instruction for the TMS320C3x is described in 
this section in alphabetical order. The description includes the assembler syn- 
tax, operation, operands, encoding, description, cycles, status bits, mode bit, 
and examples. 
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Syntax INST src, dst 

or 

INST1 src2 t dstl 
|| INST2 src3, dst2 

Each instruction begins with an assembler syntax expression. You can place 
labels either before the command (instruction mnemonic) on the same line or 
on the preceding line in the first column. The optional comment field that con- 
cludes the syntax is not included in the syntax expression. Space(s) are 
required between each field (label, command, operand, and comment fields). 

The syntax examples illustrate the common one-line syntax and the two-line 
syntax used in parallel addressing. Note that the two vertical bars || that indi- 
cate a parallel addressing pair can be placed anywhere before the mnemonic 
on the second line. The first instruction in the pair can have a label, but the sec- 
ond instruction cannot have a label. 

Operation \src | — dst 

or 

\src2 1 dstl 
|| src3-+dst2 

The instruction operation sequence describes the processing that occurs 
when the instruction is executed. For parallel instructions, the operation se- 
quence is performed in parallel. Conditional effects of status register specified 
modes are listed for such conditional instructions as Bcond. 

Operands src general addressing modes (G): 

0 0 register (Rn, 0 * n s 27) 

0 1 direct 

1 0 indirect 
1 1 immediate 

dst register (Rn, 0 * n * 27) 
or 

src2 indirect (disp = 0, 1 , IRO, IR1) 
dstl register (Rn1 , 0 * n1 * 7) 
src3 register (Rn2, 0 * n2 * 7) 
dst2 indirect (disp = 0, 1 , IRO, IR1) 

Operands are defined according to the addressing mode and/or the type of ad- 
dressing used. Note that indirect addressing uses displacements and the in- 
dex registers. Refer to Chapter 5 for detailed information on addressing. 
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Encoding 



31 



24 23 



16 15 



87 



1 1 
0 0 0 


— r— i — rn — r— 

INST 


i 

G 


— 1 — 1 1 1 

dst 


— rn — i — r 


—i — i — i — i — i — r 

src 


-i — i — r 


1 1 




or 














31 


24 23 




16 15 


87 




0 



— T — 


i i i — r— 


i i 


\ i 


—i — r— 


— n — i i i — rn — 


—\ — rn — i i i v 


1 1 


INST1||INST2 


dsn 


0 0 0 


srcQ 


dst2 


src2 



Description 



Cycles 



Status Bits 



Encoding examples are shown using general addressing and parallel addres- 
sing. The instruction pair for the parallel addressing example consists of 
INST1 and INST2. 

Instruction execution and its effect on the rest of the processor or memory con- 
tents is described. Any constraints on the operands imposed by the processor 
or the assembler are discussed. The description parallels and supplements 
the information given by the operation block. 

1 

The digit specifies the number of cycles required to execute the instruction. 

LUF Latched Floating-Point Underflow Condition Flag. 1 if a 

floating-point underflow occurs; unchanged otherwise. 

LV Latched Overflow Condition Flag. 1 if an integer or floating-point 
overflow occurs; unchanged otherwise. 

UF Floating-Point Underflow Condition Flag. 1 if a floating-point un- 
derflow occurs; 0 otherwise. 

N Negative Condition Flag. 1 if a negative result is generated; 0 other- 
wise. In some instructions, this flag is the MSB of the output. 

Z Zero Condition Flag. 1 if a 0 result is generated; 0 otherwise. For log- 
ical and shift instructions, 1 if a 0 output is generated; 0 otherwise. 

V Overflow Condition Flag. 1 if an integer or floating-point overflow oc- 
curs; 0 otherwise. 

C Carry Flag. 1 if a carry or borrow occurs; 0 otherwise. For shift instruc- 
tions, this flag is set to the value of the last bit shifted out; 0 for a shift 
count of 0. 

The seven condition flags stored in the status register (ST) are modified by the 
majority of instructions only if the destination register is R7-R0. The flags pro- 
vide information about the properties of the result or the output of arithmetic 
or logical operations. 
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Mode Bit OVM Overflow Mode Flag. In general, integer operations are affected by the 

OVM bit value (described in Table 3-2 on page 3-6). 

Example INST e98AEh,R5 

Before Instruction: 

DP = 80h 

R5 = 0766900000h = 2.30562500e+02 

Memory at 8098AEh = 5CDFh = 1 .00001 1 07e + 00 

LUF LV UF N Z V C = 0 0 0 0 0 00 

After Instruction: 

DP = 80h 

R5 = 0066900000h = 1 .80126953e + 00 

Memory at 8098AEh = 5CDFh = 1 .00001 107e + 00 

LUF LV UF N Z V C = 0 0 0 0 0 00 

The sample code presented in the above format shows the effect of the code 
on system pointers (for example, DP or SP), registers (for example, R1 or R5), 
memory at specific locations, and the seven status bits. The values given for 
the registers include the leading 0s to show the exponent in floating-point oper- 
ations. Decimal conversions are provided for all register and memory loca- 
tions. The seven status bits are listed in the order in which they appear in the 
assembler and simulator (see Section 1 0.2 on page 1 0-1 0 and Table 1 0-9 on 
page 10-13 for further information on these seven status bits). 
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ABSF Absolute Value of Floatin g-Point 



Syntax 

Operation 

Operands 



Encoding 



Description 



Cycles 
Status Bits 



Mode Bit 
Example 



ABSF src, dst 
\src\ -* dst 

src general addressing modes (G): 

0 0 register (Rn, 0 as n * 7) 

0 1 direct 

1 0 indirect 
1 1 immediate 

dst register (Rn, * 0 ns7) 



31 


24 23 


16 


15 


87 


0 


— 1 — T" 

0 0 0 


I I I I I 

0 0 0 0 0 0 


1 

G 


—| — 1 — 1 — T— 

dst 


i I i i i i i i i i i i i i i 

src 



The absolute value of the src operand is loaded into the dst register. The src 
and dst operands are assumed to be floating-point numbers. 

An overflow occurs if src (man) = 80000000h and src (exp) = 7Fh. The result 
is dst (man) = 7FFFFFFFh and dst (exp) = 7Fh. 

1 

These condition flags are modified only if the destination register is R7-R0. 

LUF Unaffected 

LV 1 if a floating-point overflow occurs; unchanged otherwise 

UF 0 

N 0 

Z 1 if a 0 result is generated; 0 otherwise 

V 1 if a floating-point overflow occurs; 0 otherwise 

C Unaffected 

OVM Operation is not affected by OVM bit value. 

ABSF R4,R7 

Before Instruction: 

R4 = 05C8000F971h = -9.90337307e + 27 
R7 = 07D251100AEh= 5.48527255e + 37 
LUF LV UF N Z V C = 0 0 0 0 0 00 

After Instruction: 

R4 = 05C8000F971h = -9.90337307e + 27 
R7 = 05C7FFF068Fh = 9.90337307e + 27 
LUF LV UF N Z V C = 0 0 0 0 0 00 
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Parallel ABSF and STF ABSF||STF 



Syntax 

Operation 

Operands 



ABSF src2,dst1 
|| STF src3, dst2 

\src2\^dst1 
|| $rc3^dst2 

src2 indirect (disp = 0, 1 , IRO, IR1) 
dst1 register (Rn1 , 0 * n1 * 7) 
src3 register (Rn2, 0sn2s7) 
dst2 indirect (disp = 0, 1 , IRO, IR1) 



Encoding 



31 



24 23 



16 15 



87 













1 1 1 1 1 1 1 




1 1 


0 0 10 0 


dsn 


0 0 0 


srcQ 


dsn 


src2 



Description 



Cycles 
Status Bits 



Mode Bit 
Example 



A floating-point absolute value and a floating-point store are performed in par- 
allel. All registers are read at the beginning and loaded at the end of the ex- 
ecute cycle. This means that if one of the parallel operations (STF) reads from 
a register and the operation being performed in parallel (ABSF) writes to the 
same register, STF accepts as input the contents of the register before it is mo- 
dified by the ABSF. 

If src2 and dst2 point to the same location, src2\s read before the write to dst2. 
If src3 and dst1 point to the same register, $rc3 is read before the write to dstl. 

An overflow occurs if src (man) = 80000000h and src (exp) = 7Fh. The result 
is dst (man) = 7FFFFFFFh and dst (exp) = 7Fh. 



1 

LUF 

LV 

UF 

N 

Z 

V 

c 



Unaffected 

1 if a floating-point overflow occurs; unchanged otherwise 

0 

0 

1 if a 0 result is generated; 0 otherwise 

1 if a floating-point overflow occurs; 0 otherwise 

Unaffected 



OVM Operation is not affected by OVM bit value. 

ABSF *++AR3(IRl) ,R4 
1 1 STF R4 , *-AR7 ( 1 ) 
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ABSF||STF Parallel ABSF and STF 



Before Instruction: 

AR3 = 809800h 
IR1 = OAFh 

R4 = 733C00000h = 1.79750e + 02 
AR7 = 8098C5h 

Data at 8098AFh = 58B4000h = - 6.118750e + 01 
Data at 8098C4h = Oh 

LUFLV UF N Z V C = 0 00 0 0 00 

After Instruction: 

AR3 = 8098AFh 
IR1 = OAFh 

R4 = 574C00000h = 6.118750e + 01 
AR7 = 8098C5h 

Data at 8098AFh = 58B4000h = -6. 118750a + 01 
Data at 8098C4h = 733C000h = 1 .79750e + 02 
LUF LV UF N Z V C = 0 0 0 0 0 00 



Note: Cycle Count 

See subsection 9.5.2 on page 9-24 for operand ordering effects on cycle 
count. 
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Absolute Value of Integer ABSI 



Syntax 

Operation 

Operands 



Encoding 



Description 



Cycles 
Status Bits 



Mode Bit 
Example 1 



ABSI src, dst 
\src\ -* dst 

src general addressing modes (G): 

0 0 any CPU register 

0 1 direct 

1 0 indirect 
1 1 immediate 

dsfany CPU register 



31 


24 23 


16 15 


87 




0 




I I I I I 


1 


l I I I 




1 1 1 1 1 1 


1 1 1 




0 0 0 


0 0 0 0 0 1 


G 


dst 




src 







The absolute value of the src operand is loaded into the dst register. The src 
and dst operands are assumed to be signed integers. 

An overflow occurs if src = 80000000h. If ST(OVM) = 1, the result is 
cfsf=7FFFFFFFh. If ST(OVM) = 0, the result is dst = 80000000h. 

1 

These condition flags are modified only if the destination register is R7-R0. 

LUF Unaffected 

LV 1 if an integer overflow occurs; unchanged otherwise 

UF 0 

N 0 

Z 1 if a 0 result is generated; 0 otherwise 

V 1 if an integer overflow occurs; 0 otherwise 

C Unaffected 

OVM Operation is affected by OVM bit value. 

ABSI R0,R0 
or 

ABSI RO 

Before Instruction: 

RO a OFFFFFFCBh = - 53 

After Instruction: 

RO = 035h = 53 
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ABSI Absolute Value of Integer 



Example 2 absi *ari,r3 

Before Instruction: 

AR1 =20h 
R3 = 0h 

Data at 20h = OFFFFFFCBh = - 53 

After Instruction: 

AR1 = 20h 
R3 = 35h = 53 

Data at 20h = OFFFFFFCBh = - 53 
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Parallel ABSI and STI ABSI[|STI 



Syntax 



Operation 



Operands 



Encoding 



ABSI src2 t dst1 

|| STI src3,dst2 

\src2 1 -> dst1 
|| src3-+dst2 

src2 indirect (disp = 0, 1 , IRO, IR1) 
dst1 register (Rn1 ,0*1*7) 
src3 register (Rn2, 0 * n2 * 7) 
dst2 indirect (disp = 0, 1 , IRO, IR1) 



31 




24 23 




16 15 




87 




0 


1 1 


0 0 10 1 


dsn 


0 0 0 


I 1 

src3 


1 1 1 1 1 1 V 

dst2 


I I 


1 1 1 1 
src2 





Description 



Cycles 
Status Bits 



An integer absolute value and an integer store are performed in parallel. All 
registers are read at the beginning and loaded at the end of the execute cycle. 
This means that, if one of the parallel operations (STI) reads from a register 
and the operation being performed in parallel (ABSI) writes to the same regis- 
ter, STI accepts as input the contents of the register before it is modified by the 
ABSI. 

If src2 and dst2 point to the same location, src2 '\s read before the write to dst2. 

An overflow occurs if src = 80000000h. If ST(OVM) = 1 , the result is dst = 
7FFFFFFFh. If ST(OVM) = 0, the result is dst = 80000000h. 

1 

These condition flags are modified only if the destination register is R7-R0. 

LUF Unaffected 

LV 1 if an integer overflow occurs; unchanged otherwise 

UF 0 

N 0 

Z 1 if a 0 result is generated; 0 otherwise 

V 1 if an integer overflow occurs; 0 otherwise 

C Unaffected 



Mode Bit 



OVM Operation is affected by OVM bit value. 



Assembly Language Instructions 1 0-27 



ABSI1 |STI Parallel ABSI and STI 



Example absi *-ar5 ( l ) , R5 

| | STI R1,*AR2 — (IR1) 

Before Instruction: 



AR5 = 8099E2h 
R5 = 0h 
R1 = 42h = 66 
AR2 = 8098FFh 
IR1 = OFh 

Data at 8099E1 h » OFFFFFFCBh = - 53 
Data at 8098FFh = 2h = 2 

LUF LV UF N Z V C = 0 0 0 0 0 00 
After Instruction: 

AR5 = 8099E2h 
R5 = 35h = 53 
R1 = 42h = 66 
AR2 = 8098F0h 
IR1 = OFh 

Data at 8099E1 h = OFFFFFFCBh = - 53 

Data at 8098FFh = 42h = 66 

LUF LV UF N Z V C = 0 0 0 0 0 00 

i 1 

Note: Cycle Count 

See subsection 9.5.2 on page 9-24 for operand ordering effects on cycle 
count. 

i i 



10-28 



Add Integer With Carry ADDC 



Syntax 

Operation 

Operands 



Encoding 



Description 

Cycles 
Status Bits 



Mode Bit 
Example 



ADDC src, dst 

dst+ src+ C -* dst 

src general addressing modes (G): 

0 0 any CPU register 

0 1 direct 

1 0 indirect 
1 1 immediate 

dst any CPU register 



31 




24 23 


16 15 


87 


0 


1 1 

0 0 0 


1 1 

0 0 


I l l 

0 1 0 


G 


I l l l 

dst 


l I I I l l I I I I I I I I I 

src 



The sum of the dst and src operands and the carry (C) flag is loaded into the 
dst register. The dst and src operands are assumed to be signed integers. 

1 

These condition flags are modified only if the destination register is R7-R0. 

LUF Unaffected 

LV 1 if an integer overflow occurs; unchanged otherwise 

UF 0 

N 1 if a negative result is generated; 0 otherwise 

Z 1 if a 0 result is generated; 0 otherwise 

V 1 if an integer overflow occurs; 0 otherwise 

C 1 if a carry occurs; 0 otherwise 

OVM Operation is affected by OVM bit value. 

ADDC R1,R5 

Before Instruction: 

R1 = 00FFFF5C25h = - 41 ,947 
R5 = 00FFFF019Eh = - 65,122 
LUF LV UF N Z V C = 0 0 0 



0 0 0 0 



After Instruction: 

R1 = 00FFFF5C25h = - 41 ,947 
R5 = 00FFFE5DC4h = - 107,068 
LUF LV UF N Z V C = 0 0 0 



0 0 0 0 



Assembly Language Instructions 1 0-29 



ADDC3 Add Integer With Carry, 3-Operand 



Syntax 

Operation 

Operands 



Encoding 



ADDC3 src2, srrf, dst 

srd + src2+ C -* dst 

srrf three-operand addressing modes (T): 
0 0 any CPU register 

0 1 indirect (disp = 0, 1 , IRO, IR1) 

1 0 any CPU register 

1 1 indirect (disp = 0, 1 , IRO, IR1) 

src2 three-operand addressing modes (T): 
0 0 any CPU register 

0 1 any CPU register 

1 0 indirect (disp = 0, 1 , IRO, IR1) 
1 1 indirect (disp = 0, 1 , IRO, IR1) 

ctefany CPU register 



31 


24 23 




16 15 




87 




0 


0 0 1 


1 1 1 1 1 

0 0 0 0 0 0 


T 


1 1 1 1 

dst 


1 1 1 1 1 1 I 

srd 


i I I I I i I 

src2 



Description 

Cycles 
Status Bits 



Mode Bit 



The sum of the srrf and src2 operands and the carry (C) flag is loaded into 
the dst register. The srrf, src2, and dst operands are assumed to be signed 
integers. 

1 

These condition flags are modified only if the destination register is R7-R0. 

LUF Unaffected 

LV 1 if an integer overflow occurs; unchanged otherwise 

U 0 

N 1 if a negative result is generated; 0 otherwise 

Z 1 if a 0 result is generated; 0 otherwise 

V 1 if an integer overflow occurs; 0 otherwise 

C 1 if a carry occurs; 0 otherwise 

OVM Operation is affected by OVM bit value. 



10-30 



Add Integer With Carry, 3-Operand ADDC3 



Example 1 



ADDC3 *AR5++(IR0) ,R5,R2 



or 



ADDC3 R5,*AR5++(IR0) ,R2 

Before Instruction: 

AR5 = 809908h 
IR0 = 10h 
R5 = 066h = 102 
R2 = 0h 

Data at 809908h = OFFFFFFCBh = - 53 

LUF LV UF N Z V C = 0 0 0 0 0 0 1 

After Instruction: 

AR5 = 80991 8h 
IR0 = 10h 
R5 = 066h = 102 
R2 = 032h = 50 

Data at 809908h = OFFFFFFCBh = - 53 

LUF LV UF N Z V C = 0 0 0 0 0 0 1 



Before Instruction: 

R2 = 02BCh = 700 
R7 = 0F82h = 3970 
R0 = 0h 

LUF LV UF N Z V C = 0 0 0 0 0 0 1 

After Instruction: 

R2 = 02BCh = 700 
R7 = 0F82h = 3970 
R0 = 0123Fh = 4671 

LUF LV UF N Z V C = 0 0 0 0 0 00 

i i 
Note: Cycle Count 

See subsection 9.5.2 on page 9-24 for operand ordering effects on cycle 
count. 



Example 2 



ADDC3 R2, R7, R0 



Assembly Language Instructions 
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ADDF AddFloertng-Poto^ 



Syntax 

Operation 

Operands 



Encoding 



Description 

Cycles 
Status Bits 



Mode Bit 
Example 



ADDF src, dst 
dst+ src -+ dst 

src general addressing modes (G): 

0 0 register (Rn, 0 s n * 7) 

0 1 direct 

1 0 indirect 
1 1 immediate 

dst register (Rn, 0 * n as 7) 



31 


24 23 




16 


15 


87 


0 


1 1 

0 0 0 


— rn — i — i — r— 

0 0 0 0 1 1 


1 

G 


i i i i 

dst 


l 1 l l l l l l l I I l I I I 

src 



The sum of the dst and srcoperands is loaded into the dst register. The dst and 
src operands are assumed to be floating-point numbers. 

1 

These condition flags are modified only if the destination register is R7-R0. 

LUF 1 if a floating-point underflow occurs; unchanged otherwise 

LV 1 if a floating-point overflow occurs; unchanged otherwise 

UF 1 if a floating-point underflow occurs; 0 otherwise 

N 1 if a negative result is generated; 0 otherwise 

Z 1 if a 0 result is generated; 0 otherwise 

V 1 if a floating-point overflow occurs; 0 otherwise 

C Unaffected 

OVM Operation is not affected by OVM bit value. 

ADDF *AR4++(IR1),R5 

Before Instruction: 

AR4 = 809800h 
IR1=12Bh 

R5 = 0579800000h = 6.23750e+01 

Data at 809800h = 86B2800h = 4.7031 250e + 02 

LUF LV UF N Z V C = 0 0 0 0 0 00 

After Instruction: 

AR4 = 80992Bh 
IR1 =12Bh 

R5 = 09052C0000h = 5.3268750e+02 

Data at 809800h = 86B2800h = 4.7031 250e + 02 

LUF LV UF N Z V C = 0 0 0 0 0 00 



10-32 



Add Floating-Point, 3-Operand ADDF3 



Syntax 

Operation 

Operands 



Encoding 



Description 

Cycles 
Status Bits 



Mode Bit 
Example 1 



ADDF3 src2 % src1 t dst 
srd + src2-+ dst 

srd three-operand addressing modes (T): 
0 0 register (Rn1 , 0 * n1 * 7) 

0 1 indirect (disp = 0, 1 , IRO, IR1) 

1 0 register (Rn1 , 0 * n1 * 7) 

1 1 indirect (disp = 0, 1, IRO, IR1) 

src2 three-operand addressing modes (T): 
0 0 register (Rn2, 0 * n2 * 7) 

0 1 register (Rn2, 0 * n2 * 7) 

1 0 indirect (disp = 0, 1 , IRO, IR1) 
1 1 indirect (disp = 0, 1, IRO, IR1) 

dst register (Rn, 0 as n * 7) 



31 


24 23 




16 15 




87 




0 


I 1 

0 0 1 


I I l l l 
0 0 0 0 0 1 


1 

T 


i i i i 

dst 


I I l l l l l 

srd 


I I l I l l l 

src2 



The sum of the srd and src2 operands is loaded into the cfefregister. The srd, 
src2, and dst operands are assumed to be floating-point numbers. 

1 

These condition flags are modified only if the destination register is R7-R0. 
LUF 1 if a floating-point underflow occurs; unchanged otherwise 
1 if a floating-point overflow occurs; unchanged otherwise 
1 if a floating-point underflow occurs; 0 otherwise 
1 if a negative result is generated; 0 otherwise 
1 if a 0 result is generated; 0 otherwise 
1 if a floating-point overflow occurs; 0 otherwise 
Unaffected 



LV 
UF 
N 
Z 
V 

c 

OVM 



Operation is not affected by OVM bit value. 



ADDF3 R6,R5,R1 
or 

ADDF3 R5,R6,R1 

Before Instruction: 

R6 = 086B280000h = 4.7031 250e + 02 
R5 = 0579800000h = 6.23750e+01 
R1 =0h 

LUF LV UF N Z V C = 0 0 0 0 



0 0 0 



Assembly Language Instructions 1 0-33 



ADDF3 Add Floating-Point, 3-Operand 



After Instruction: 

R6 = 086B280000h = 4.7031 250e + 02 

R5 = 0579800000h = 6.23750e + 01 

R1 = 09052C0000h = 5.32687506 + 02 

LUF LV UF N Z V C = 0 0 0 0 0 00 

Example 2 addf3 *+ar1(1) ,*ar7++(iro) ,R4 

Before Instruction: 

AR1 = 809820h 
AR7 = 8099F0h 
IRO = 8h 
R4 = 0h 

Data at 809821 h - 700F000h = 1 .28940e + 02 
Data at 8099F0h = 34C2000h = 1 .27590e + 01 
LUF LV UF N Z V C = 0 0 0 0 0 00 

After Instruction: 

AR1 = 809820h 
AR7 = 8099F8h 
IRO = 8h 

R4 = 070DB20000h = 1 .416953136 + 02 
Data at 809821 h = 700F000h = 1 .289406 + 02 
Data at 8099F0h = 34C2000h = 1 .275906 + 01 
LUF LV UF N Z V C = 0 0 0 0 0 00 

i i 
Note: Cycle Count 

See subsection 9.5.2 on page 9-24 for operand ordering effects on cycle 
count. 

■ ■ 
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Parallel ADDF3 and STF ADDF3| |STF 



Syntax ADDF3 src2, srrf, dst1 

|| STF src3, dst2 

Operation srrf + src2 -+ dst 1 

|| src3-+dst2 

Operands srrf register (Rn1 , 0 * n1 * 7) 

src2 indirect (disp = 0, 1 , IRO, IR1) 

dst1 register (Rn2, 0 s n2 s 7) 

src3 register (Rn3, 0 * n3 s 7) 

dst2 indirect (disp = 0, 1 , IRO, IR1) 

Encoding 



31 




24 23 




16 15 




87 




0 


1 1 


0 0 110 


cfefl 


srd 


l l 

src3 


I I I I l l I 

dst2 


1 I T 1 ! T T 

src2 



Description 



Cycles 
Status Bits 



A floating-point addition and a floating-point store are performed in parallel. All 
registers are read at the beginning and loaded at the end of the execute cycle. 
This means that if one of the parallel operations (STF) reads from a register 
and the operation being performed in parallel (ADDF3) writes to the same reg- 
ister, STF accepts as input the contents of the register before it is modified by 
the ADDF3. 

If src2 and dst2 point to the same location, src2 is read before the write to dst2. 
1 

These condition flags are modified only if the destination register is R7-R0. 

LUF 1 if a floating-point underflow occurs; unchanged otherwise 

LV 1 if a floating-point overflow occurs; unchanged otherwise 

UF 1 if a floating-point underflow occurs; 0 otherwise 

N 1 if a negative result is generated; 0 otherwise 

Z 1 if a 0 result is generated; 0 otherwise 

V 1 if a floating-point overflow occurs; 0 otherwise 

C Unaffected 



Mode Bit OVM Operation is not affected by OVM bit value. 

Example addf3 *+ar3(ir1) ,R2,R5 

| | STF R4 , *AR2 



Assembly Language Instructions 1 0-35 



ADDF31 |STF Parallel ADDF3 and STF 



Before Instruction: 

AR3 = 809800h 
IR1 = 0A5h 

R2 = 070C800000h = 1 .40506 + 02 
R5 = Oh 

R4 = 057B400000h = 6.281 250e + 01 
AR2 = 8098F3h 

Data at 8098A5h = 733C000h = 1 .797506 + 02 
Data at 8098F3h = Oh 

LUF LV UF N Z V C = 0 0 0 0 0 00 

After Instruction: 

AR3 = 809800h 
IR1 « 0A5h 

R2 = 070C800000h = 1 .4050e+02 
R5 = 0820200000h = 3.202506 + 02 
R4 x 057B400000h = 6.2812506 + 01 
AR2 = 8098F3h 

Data at 8098A5h = 733C000h = 1 .797506 + 02 
Data at 8098F3h = 57B4000h = 6.281256 + 01 
LUF LV UF N Z V C ■ 0 0 0 0 0 00 

i i 
Note: Cycle Count 

See subsection 9.5.2 on page 9-24 for operand ordering effects on cycle 
count. 

i i 
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Add Integer ADDI 



Syntax ADDI src, dst 

Operation dst + src -* dst 

Operands src general addressing modes (G): 

0 0 any CPU register 

0 1 direct 

1 0 indirect 
1 1 immediate 

dst any CPU register 

Encoding 



31 




24 23 


16 15 


87 


0 


0 0 0 


0 0 0 


1 I 
1 0 0 


G 


I l l l 

dst 


i ill 


src 





Description 

Cycles 
Status Bits 



Mode Bit 
Example 



The sum of the dst and src operands is loaded into the the dst register. The 
dst and src operands are assumed to be signed integers. 

1 

These condition flags are modified only if the destination register is R7-R0. 

LUF Unaffected 

LV 1 if an integer overflow occurs; unchanged otherwise 

UF 0 

N 1 if a negative result is generated; 0 otherwise 

Z 1 if a 0 result is generated; 0 otherwise 

V 1 if an integer overflow occurs; 0 otherwise 

C 1 if a carry occurs; 0 otherwise 

OVM Operation is affected by OVM bit value. 

ADDI R3,R7 



Before Instruction: 



R3 = OFFFFFFCBh = -53 
R7 = 35h = 53 

LUF LV UF N Z V C = 0 0 0 0 0 00 

After Instruction: 

R3 = OFFFFFFCBh = -53 
R7 = 0h 

LUFLV UF N Z V C = 0 0 0 0 0 0 0 



Assembly Language Instructions 1 0-37 



ADD I3 Add Integer, 3-Operand 



Syntax 

Operation 

Operands 



ADDI3 <src2 >,<srrf >,<d$t > 
scd + src2~* dst 

srd three-operand addressing modes (T): 
0 0 any CPU register 

0 1 indirect (disp = 0, 1 , IRO, IR1) 

1 0 any CPU register 

1 1 indirect (disp = 0, 1, IRO, IR1) 

src2 three-operand addressing modes (T): 
0 0 any CPU register 

0 1 any CPU register 

1 0 indirect (disp = 0, 1 , IRO, IR1) 
1 1 indirect (disp = 0, 1 , IRO, IR1) 

cfefany CPU register 



Encoding 



31 


24 23 




16 15 




87 




0 


"i — r- 

0 0 1 


— i — i — r— i — r— 

0 0 0 0 1 0 


— r— 

T 


I l I l 

dst 


i i i i i i i 

srd 


i 


i — i — i — r 

$rc2 


1 — r— 



Description 

Cycles 
Status Bits 



Mode Bit 
Example 1 



The sum of the srd and src2 operands is loaded into the cfef register. The srd, 
src2, and dst operands are assumed to be signed integers. 

1 

These condition flags are modified only if the destination register is R7-R0. 

LUF Unaffected 

LV 1 if an integer overflow occurs; unchanged otherwise 

UF 0 

N 1 if a negative result is generated; 0 otherwise 

Z 1 if a 0 result is generated; 0 otherwise 

V 1 if an integer overflow occurs; 0 otherwise 

C 1 if a carry occurs; 0 otherwise 

OVM Operation is affected by OVM bit value. 

ADD I 3 R4 f R7 / R5 

Before Instruction: 

R4 = ODCh = 220 
R7 = 0A0h = 160 
R5 = 10h = 16 

LUF LV UF N Z V C = 0 0 0 0 0 00 



10-38 



Add Integer, 3-Operand ADDI3 



After Instruction: 

R4 = ODCh = 220 
R7 = 0A0h = 160 
R5 = 017Ch = 380 

LUF LV UF N Z V C = 0 0 0 0 0 00 

Example 2 addi3 *-ar3 ( l ) , *ar6 — (iR0),R2 

Before Instruction: 

AR3 = 809802h 
AR6 = 809930h 
IR0 = 18h 
R2 = 10h = 16 

Data at 809801 h = 2AF8h = 11 ,000 
Data at 809930h = 3A98h = 1 5,000 
LUF LV UF N Z V C = 0 0 0 0 0 00 

After Instruction: 

AR3 = 809802h 
AR6 = 80991 8h 
IR0 = 18h 

R2 = 06598h = 26,000 

Data at 809801 h = 2AF8h = 11 ,000 

Data at 809930h = 3A98h = 15,000 

LUF LV UF N Z V C = 0 0 0 0 0 00 



Note: Cycle Count 

See subsection 9.5.2 on page 9-24 for operand ordering effects on cycle 
count. 



Assembly Language Instructions 1 0-39 



ADDI3HSTI Parallel ADDI3 and STI 



Syntax ADDI3 src2, srd, dst1 

|| ST! src3,dst2 

Operation srd + src2 ~* d$t1 

|| src3-+dst2 

Operands srd register (Rn1 , 0 as n1 * 7) 

src2 indirect (disp = 0, 1, IRO, IR1) 

dst 1 register (Rn2, 0 as n2 ss 7) 

src3 register (Rn3, 0 a; n3 ss 7) 

dst2 indirect (disp = 0, 1 , IRO, IR1) 

Encoding 



31 




24 23 




16 15 




87 




0 


1 1 


i i i — r— 

0 0 111 


i i 

flfsfl 


— T— 1 — 

srd 


i i 

src3 


i i i i i i i 

dst2 


l I I I w 1 I 

src2 



Description 



Cycles 
Status Bits 



Mode Bit 



An integer addition and an integer store are performed in parallel. All registers 
are read at the beginning and loaded at the end of the execute cycle. This 
means that if one of the parallel operations (STI) reads from a register and the 
operation being performed in parallel (ADDI3) writes to the same register, STI 
accepts as input the contents of the register before it is modified by the ADDIS. 

If src2 and ds\2 point to the same location, src2 is read before the write to d$t2. 

1 

These condition flags are modified only if the destination register is R7-R0. 
LUF Unaffected 

LV 1 if an integer overflow occurs; unchanged otherwise 
UF 0 

N 1 if a negative result is generated; 0 otherwise 
Z 1 if a 0 result is generated; 0 otherwise 
V 1 if an integer overflow occurs; 0 otherwise 
C 1 if a carry occurs; 0 otherwise 

OVM Operation is affected by OVM bit value. 



10-40 



Parallel ADDI3 and STI APPI3||STI 



Example addi3 *aro — (iro),r5,ro 

|| STI R3,*AR7 

Before Instruction: 

ARO = 80992Ch 

IRO = OCh 

R5 = ODCh = 220 

RO = Oh 

R3 = 35h = 53 

AR7 = 80983Bh 

Data at 80992Ch = 12Ch = 300 

Data at 80983Bh = Oh 

LUFLVUFNZVC = 000 0 0 00 

After Instruction: 

ARO = 809920h 

IRO = OCh 

R5 = ODCh = 220 

RO = 208h = 520 

R3 = 35h = 53 

AR7 = 80983Bh 

Data at 80992Ch = 12Ch = 300 

Data at 80983Bh = 35h = 53 

LUF LV UF N Z V C = 0 0 0 0 0 00 

i i 
Note: Cycle Count 

See subsection 9.5.2 on page 9-24 for operand ordering effects on cycle 
count. 

i i 



Assembly Language Instructions 1 0-41 



AND Bitwise Logical-AND 



Syntax 

Operands 

Operands 



Encoding 



AND src, dst 
dst AND src -+ dst 

src general addressing modes (G): 

0 0 any CPU register 

0 1 direct 

1 0 indirect 

1 1 immediate (not sign-extended) 
dst any CPU register 



31 


24 23 






16 15 


87 


0 


0 0 0 


1 1 1 1 — T— 

0 0 0 1 0 1 


Q 


l l I l 

dst 


i i I i i I i l i i i i i i i 

src 



Description The bitwise logical-AND between the dst and src operands is loaded into the 

dst register. The dst and src operands are assumed to be unsigned integers. 

Cycles 1 

Status Bits These condition flags are modified only if the destination register is R7-R0. 

LUF Unaffected 

LV Unaffected 

UF 0 

N MSB of the output. 

Z 1 if a 0 result is generated; 0 otherwise 

V 0 

C Unaffected 

OVM Operation is not affected by OVM bit value. 

AND R1,R2 

Before Instruction: 



Mode Bit 
Example 



R1 = 80h 
R2 = OAFFh 

LUF LV UF N Z V C = 0 0 0 0 0 0 1 
After Instruction: 

R1 = 80h 
R2 = 80h 

LUF LV UF N 2 V C = 0 0 0 0 0 0 1 
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Bitwise Logical-AND, 3-Operand AND3 



Syntax 

Operation 

Operands 



Encoding 



Description 

Cycles 
Status Bits 



AND3 src2, srrt, dst 

srd AND src2-+ dst 

srd three-operand addressing modes (T): 
0 0 any CPU register 

0 1 indirect (disp = 0, 1 , IRO, IR1) 

1 0 any CPU register 

1 1 indirect (disp = 0, 1 , IRO, IR1) 

src2 three-operand addressing modes (T): 
0 0 any CPU register 

0 1 any CPU register 

1 0 indirect (disp = 0,1, IRO, IR1) 
1 1 indirect (disp = 0, 1 , IRO, IR1) 

dst any CPU register 



31 


24 23 






16 15 




87 




0 


0 0 1 


i i i i — r- 

0 0 0 0 1 1 


— 1 — 

T 


I I I I 

dst 


l l l I I I l 

srd 


I I I I 1 1 1 

src2 



The bitwise logical-AND between the srd and src2 operands is loaded into 
the destination register. The srtf, src2 t and dst operands are assumed to be 
unsigned integers. 

1 

These condition flags are modified only if the destination register is R7-R0. 

LUF Unaffected 

LV Unaffected 

UF 0 

N MSB of the output. 

Z 1 if a 0 result is generated; 0 otherwise 

V 0 

C Unaffected 



Mode Bit 



OVM Operation is not affected by OVM bit value. 



Assembly Language Instructions 1 0-43 



AN D3 Bitwise Logical-AND, 3-Operand 



Example 1 and3 *aro — ( iro ) , *+ari ,R4 

Before Instruction: 

ARO = 8098F4h 
IRO = 50h 
AR1 = 809951 h 
R4 = 0h 

Data at 8098F4h = 30h 
Dataat809952h = 123h 

LUFLVUFNZVC = 000 0 0 00 

After Instruction: 

ARO = 8098A4h 
IRO = 50h 
AR1 = 809951 h 
R4 = 020h 

Data at 8098F4h = 30h 
Dataat809952h = 123h 

LUF LV UF N Z V C = 0 0 0 0 0 00 

Example 2 and3 *-ars ,R7,R4 

Before Instruction: 

AR5 = 80985Ch 
R7 = 2h 
R4 = 0h 

Data at 80985Bh = OAFFh 

LUF LV UF N Z V C = 0 0 0 0 0 00 

After Instruction: 

AR5 = 80985Ch 
R7 = 2h 
R4 = 2h 

Data at 80985Bh = OAFFh 

LUF LV UF N Z V C = 0 0 0 0 0 00 



Note: Cycle Count 

See subsection 9.5.2 on page 9-24 for operand ordering effects on cycle 
count. 
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Parallel AND3 and STI AND3HSTI 



Syntax AND3 src2, srrf, d$t1 

|| STI src3,dst2 

Operation srd AND src2 -* dst1 

|| src3 dst2 

Operands srd register (Rn1 , 0 * n1 s7) 

src2 indirect (disp = 0, 1 , IRO, IR1) 

dst1 register (Rn2, 0 *n2 *7) 

src3 register (Rn3, 0 as n3 & 7) 

dst2 indirect (disp = 0, 1 , IRO, IR1) 

Encoding 



31 




24 23 




1615 




87 




0 


— 1— 

1 1 


I I I I 

0 10 0 0 


1 1 

dst\ 


1 1 

srd 


l 1 

src3 


1 1 1 1 1 1 1 

dst2 


1 1 1 1 1 1 1 

src2 



Description 



Cycles 
Status Bits 



A bitwise logical-AND and an integer store are performed in parallel. All regis- 
ters are read at the beginning and loaded at the end of the execute cycle. This 
means that if one of the parallel operations (STI) reads from a register and the 
operation being performed in parallel (AND3) writes to the same register, STI 
accepts as input the contents of the register before it is modified by the AND3. 

If src2 and dst2 point to the same location, src2 \s read before the write to dst2, 

1 

These condition flags are modified only if the destination register is R7-R0. 
LUF Unaffected 
LV Unaffected 
UF 0 

N MSB of the output. 

Z 1 if a 0 result is generated; 0 otherwise 

V 0 

C Unaffected 



Mode Bit 



OVM Operation is not affected by OVM bit value. 
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AND31 |STI Parallel AND3 and STI 



Example and3 *+ar1(iro> ,R4,R7 

|| STI R3,*AR2 

Before Instruction: 

AR1 =8099F1h 

IRO = 8h 

R4 = 0A323h 

R7 = Oh 

R3 = 35h = 53 

AR2 = 80983Fh 

Data at 8099F9h = 5C53h 

Data at 80983Fh = Oh 

LUF LV UF N Z V C « 0 0 0 0 0 00 

After Instruction: 

AR1 =8099F1h 

R0 = 8h 

R4 = 0A323h 

R7 = 03h 

R3 = 35h = 53 

AR2 m 80983Fh 

Data at 8099F9h = 5C53h 

Data at 80983Fh = 35h = 53 

LUF LV UF N Z V C - 0 0 0 0 0 00 



Note: Cycle Count 

See subsection 9.5.2 on page 9-24 for operand ordering effects on cycle 
count. 

I , , , Ml , I I 
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Bitwise Logical- AND With Complement AN DN 



Syntax 

Operation 

Operands 



Encoding 



Description 

Cycles 
Status Bits 



Mode Bit 
Example 



ANDN src, dst 
dst AND -src-* dst 

src general addressing modes (G): 

0 0 any CPU register 

0 1 direct 

1 0 indirect 

1 1 immediate (not sign-extended) 
dst any CPU register 



31 


24 23 




16 15 


87 


0 


— 1 — 1 — 

0 0 0 


I I I I I 

0 0 0 1 1 0 


1 

G 


i I I I 

dst 


i i i i i i i i i i i i i i I 

src 



The bitwise logical-AND between the dst operand and the bitwise logical com- 
plement (-) of the src operand is loaded into the dst register. The dst and src 
operands are assumed to be unsigned integers. 

1 

These condition flags are modified only if the destination register is R7-R0. 
LUF Unaffected 
Unaffected 
0 

MSB of the output. 

1 if a 0 result is generated; 0 otherwise 
0 

Unaffected 



LV 
UF 
N 
Z 
V 

c 

OVM 



Operation is not affected by OVM bit value. 

ANDN @980Ch,R2 

Before Instruction: 

DP = 80h 
R2 = 0C2Fh 

Data at 80980Ch = 0A02h 

LUF LV UF N Z V C = 0 0 0 0 0 00 

After Instruction: 

DP = 80h 
R2 = 042Dh 

Data at 80980Ch = 0A02h 

LUF LV UF N Z V C = 0 0 0 0 0 00 
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ANDN3 Bitwise Logical-ANDN, 3-Operand 



Syntax 

Operation 

Operands 



Encoding 



Description 

Cycles 
Status Bits 



Mode Bit 
Example 1 



ANDN3 src2, srcl, dst 

srd AND ~src2 -* dst 

srd three-operand addressing modes (T): 
0 0 any CPU register 

0 1 indirect (disp = 0, 1 , IRO, IR1) 

1 0 any CPU register 

1 1 indirect (disp = 0, 1 , IRO, IR1) 

src2 three-operand addressing modes (T): 

0 0 any CPU register 

0 1 any CPU register 

1 0 indirect (disp = 0, 1 , IRO, IR1) 
1 1 indirect (disp = 0, 1, IO0, IR1) 

dst register (Rn, 0 * n * 27) 



31 


24 23 






16 15 




87 




0 


0 0 1 


1 1 1 — 1 1 

0 0 0 1 0 0 


T 


I I 1 I 

dst 


I l l l ■ I I 

srd 


I I I I I I I 

src2 



The bitwise logical-AND between the srd operand and the bitwise logical 
complement (-) of the src2 operand is loaded into the dst register. The srd, 
src2, and dst operands are assumed to be unsigned integers. 

1 

These condition flags are modified only if the destination register is R7-R0. 

LUF Unaffected 

LV Unaffected 

UF 0 

N MSB of the output. 

Z 1 if a 0 result is generated; 0 otherwise 

V 0 

C Unaffected 

OVM Operation is not affected by OVM bit value. 

ANDN3 R5,R3,R7 

Before Instruction: 

R5 = 0A02h 
R3 = 0C2Fh 
R7 = 0h 

LUF LV UF N Z V C = 0 0 0 0 0 00 
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Bitwise Logical-ANDN, 3-Operand ANDN3 



After Instruction: 

R5 = 0A02h 
R3 = 0C2Fh 
R7 = 042Dh 

LUF LV UF N Z V C = 0 0 0 0 0 00 

Example 2 andn3 ri,*ar5++(iro) ,ro 

Before Instruction: 

R1 = OCFh 
AR5 = 809825h 
IRO = 5h 
R0 = 0h 

Data at 809825h = OFFFh 

LUF LV UF N Z V C = 0 0 0 0 0 00 
After Instruction: 

R1 = OCFh 
AR5 = 80982Ah 
IRO = 5h 
RO = 0F30h 

Data at 809825h = OFFFh 

LUF LV UF N Z V C = 0 0 0 0 0 0 0 

i i 
Note: Cycle Count 

See subsection 9.5.2 on page 9-24 for operand ordering effects on cycle 
count. 

* ■ 
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ASH Arithmetic Shift 



Syntax 
Operation 



Operands 



Encoding 



ASH count, dst 

If (count * 0): 
dst « count - 

Else: 



dst 



dst» \count | -* dst 

count general addressing modes (G): 

0 0 any CPU register 

0 1 direct 

1 0 indirect 
1 1 immediate 

dst any CPU register 



31 


24 23 




16 15 


87 


0 




i i — i — i — r- 




I I I I 




1 1 1 1 f — 1 1 1 


1 1 1 


0 0 0 


0 0 0 1 1 1 


G 


dst 




count 





Description The seven least significant bits of the count operand are used to generate the 

two's complement shift count of up to 32 bits. 

If the count operand is greater than 0, the dst operand is left-shifted by the 
value of the count operand. Low-order bits shifted in are (Milled, and high-ord- 
er bits are shifted out through the carry (C) bit. 

Arithmetic left-shift: 
C+-dst+-0 

If the count operand is less than 0, the dst operand is right-shifted by the abso- 
lute value of the count operand. The high-order bits of the dst operand are sign- 
extended as it is right-shifted. Low-order bits are shifted out through the C bit. 

Arithmetic right-shift: 
sign of dsf-* dst-+C 

If the count operand is 0, no shift is performed, and the C bit is set to 0. The 
count and dst operands are assumed to be signed integers. 

Cycles 1 
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Arithmetic Shift ASH 



Status Bits 



Mode Bit 
Example 1 



These condition flags are modified only if the destination register is R7-R0. 

LUF Unaffected 

LV 1 if an integer overflow occurs; unchanged otherwise 

UF 0 

N MSB of the output. 

Z 1 if a 0 result is generated; 0 otherwise 

V 1 if an integer overflow occurs; 0 otherwise 

C Set to the value of the last bit shifted out. 0 for a shift count of 0. 

OVM Operation is not affected by OVM bit value. 

ASH R1,R3 

Before Instruction: 



R1 = 10h = 16 
R3 = OAEOOOh 

LUFLVUFNZVC = 000 0 0 00 
After Instruction: 



Example 2 



R1 = 10h 

R3 = OEOOOOOOOh 

LUF LV UF N Z V C = 0 1 



0 10 10 



ASH @98C3h,R5 

Before Instruction: 
DP = 80h 

R5 = 0AEC00001h 

Data at 8098C3h = 0FFE8 = -24 

LUF LV UF N Z V C = 0 0 0 0 0 00 

After Instruction: 



DP = 80h 

R5 = OFFFFFFAEh 

Data at 8098C3h = 0FFE8 = -24 

LUF LV UF N Z V C = 0 0 



0 10 0 1 
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ASH3 Arithmetic Shift, 3-Operand 



Syntax 
Operation 



Operands 



ASH3 count, src, dst 



If (count* 0): 
src« count- 

Else: 



dst 



src» \count | -* dst 

count three-operand addressing modes (T): 
0 0 register (Rn2, 0 as n2 as 27) 

0 1 register (Rn2, 0 s n2 * 27) 

1 0 indirect (disp = 0, 1 , IRO, IR1) 
1 1 indirect (disp = 0, 1 , IRO, IR1) 



src three-operand addressing modes (T): 
0 0 register (Rn1 , 0 as n1 as 27) 

0 1 indirect (disp = 0, 1 , IRO, IR1) 

1 0 register (Rn1 , 0 as n1 as 27) 

1 1 indirect (disp = 0, 1 , IRO, IR1) 

dst register (Rn, 0 ss n a; 27) 

Encoding 



31 


24 23 


16 15 




87 




0 


1 1 

0 0 1 


I I I I I 

0 0 0 1 0 1 


T 


III! 

dst 


i i i i i i i 

src 


i 


1111 

count 


mr- 



Description The seven least significant bits of the count operand are used to generate the 

two's complement shift count of up to 32 bits. 

If the count operand is greater than 0, the src operand is left-shifted by the 
value of the count operand. Low-order bits shifted in are 0-filled, and high-ord- 
er bits are shifted out through the status register's C bit. 

Arithmetic left-shift: 
C «- src «- 0 

If the count operand is less than 0, the src operand is right-shifted by the abso- 
lute value of the count operand. The high-order bits of the srcoperand are sign- 
extended as they are right-shifted. Low-order bits are shifted out through the 
C (carry) bit. 

Arithmetic right-shift: 
sign of src -* src -* C 

If the count operand is 0, no shift is performed, and the C bit is set to 0. The 
count, src, and dst operands are assumed to be signed integers. 
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Arithmetic Shift, 3-Operand ASH3 



Cycles 
Status Bits 



Mode Bit 
Example 



1 

These condition flags are modified only if the destination register is R7-R0. 

LUF Unaffected 

LV 1 if an integer overflow occurs; unchanged otherwise 

UF 0 

N MSB of the output. 

Z 1 if a 0 result is generated; 0 otherwise 

V 1 if an integer overflow occurs; 0 otherwise 

C Set to the value of the last bit shifted out. 0 for a shift count of 0. 

OVM Operation is not affected by OVM bit value. 

ASH3 *AR3 — (1),R5,R0 

Before Instruction: 

AR3 = 809921 h 
R5 = 02B0h 
R0 = 0h 

Dataat809921h = 10h = 16 
LUF LV UF N Z V C = 0 

After Instruction: 

AR3 = 809920h 
R5 = 000002B0h 
RO - 02B00000h 
Dataat809921h = 10h = 16 
LUF LV UF N Z V C = 0 

Example ASH3 R1.R3.R5 

Before Instruction: 

R1 = 0FFFFFFF8h = - 8 
R3 = OFFFFCBOOh 
R5 = 0h 

LUF LV UF N Z V 
After Instruction: 



0 0 0 0 0 0 



0 0 0 0 0 0 



C = 0 0 0 0 0 00 



R1 = 0FFFFFFF8h = - 8 
R3 = OFFFFCBOOh 
R5 = OFFFFFFCBh 
LUF LV UF N Z V C 



= 0 0 0 1 0 0 0 



Note: Cycle Count 

See subsection 9.5.2 on page 9-24 for operand ordering effects on cycle 
count. 
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ASH3| |STI Parallel ASH 3 and STI 



Syntax 
Operation 



Operands 



Encoding 



31 



ASH3 count, src2, dst1 
|| STI src3, dst2 

If (counffcO): 

src2« count-* dst1 

Else: 

$rc2» \couni\ dst1 
|| src3-+dst2 

count register (Rn1 , 0 s n1 * 7) 

$rc2 indirect (disp = 0, 1 , IRO, IR1) 
dst1 register (Rn2, 0 * n2 * 7) 
src3 register (Rn3, 0 * n3 * 7) 
cfef2 indirect (disp = 0, 1 , IRO, IR1) 



24 23 



1615 



87 





1111 


f 1 


1 1 


1 1 


11)1111 


1 I \ 1 1 i \ 


1 1 


0 10 0 1 


dst\ 


count 


srcQ 


dst2 


src2 



Description The seven least significant bits of the count operand register are used to gen- 

erate the two's complement shift count of up to 32 bits. 

If the count operand is greater than 0, the src2 operand is left-shifted by the 
value of the count operand. Low-order bits shifted in are O-filled, and high-ord- 
er bits are shifted out through the C bit. 

Arithmetic left-shift: 
C «- src2 «- 0 

If the count operand is less than 0, the src2 operand is right-shifted by the ab- 
solute value of the count operand. The high-order bits of the src2 operand are 
sign-extended as it is right-shifted. Low-order bits are shifted out through the 
Cbit. 

Arithmetic right-shift: 

sign of src2 —> $rc2 -* C 

If the count operand is 0, no shift is performed, and the C bit is set to 0. The 
count and dst operands are assumed to be signed integers. 

All registers are read at the beginning and loaded at the end of the execute 
cycle. This means that, if one of the parallel operations (STI) reads from a reg- 
ister and the operation being performed in parallel (ASH3) writes to the same 
register, STI accepts as input the contents of the register before it is modified 
by the ASH3. 
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Parallel ASH3 and STI ASH3||STI 



Cycles 
Status Bits 



Mode Bit 
Example 



If src2 and dst2 point to the same location, src2is read before the write to dst2. 
1 



These condition flags are modified only if the destination register is R7-R0. 
LUF Unaffected 

1 if an integer overflow occurs; unchanged otherwise 
0 

MSB of the output 

1 if a 0 result is generated; 0 otherwise 
1 if an integer overflow occurs; 0 otherwise 
Set to the value of the last bit shifted out. 0 for a shift count of 0. 



LV 
UF 
N 
Z 
V 

c 

OVM 



Operation is not affected by OVM bit value. 



ASH3 Rl , *AR6++ ( IR1 ) , R0 
| | STI R5 , *AR2 

Before Instruction: 

AR6 = 809900h 
IR1 = 8Ch 

R1 = 0FFE8h = - 24 
R0 = 0h 
R5 = 35h = 53 
AR2 = 8098A2h 

Data at 809900h = OAEOOOOOOh 
Data at 8098A2h = Oh 

LUF LV UF N Z V C = 0 0 0 0 0 00 

After Instruction: 

AR6 = 80998Ch 
IR1 = 8Ch 

R1 = 0FFE8h = - 24 
R0 = OFFFFFFAEh 
R5 = 35h = 53 
AR2 = 8098A2h 

Data at 809900h = OAEOOOOOOh 

Data at 8098A2h = 35h = 53 

LUF LV UF N Z V C = 0 0 0 1 0 0 0 



Note: Cycle Count 

See subsection 9.5.2 on page 9-24 for operand ordering effects on cycle 
count. 
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Bcond w Branch^ 



Syntax 
Operation 



Operands 



Encoding 



31 



Bcond src 

If cond is true: 

If src is in register-addressing mode (Rn, 0 * n * 27), 
src -* PC. 

If src is in PC-relative mode (label or address), 

displacement + PC + 1 -> PC. 
Else, continue. 

src conditional-branch addressing modes (B): 

0 register 

1 PC-relative 



o 1 1 



i — i — r- 

o 1 o 



24 23 



l l 

0 0 0 



t — i — r 

cond 



16 15 



T 



t — r 



87 

-rn 



i — i — i — rn — i — i — i — r 

register or displacement 



Description 



Bcond signifies a standard branch that executes in four cycles. A branch is per- 
formed if the condition is true (since a pipeline flush also occurs on a true condi- 
tion; see Section 9.2 on page 9-4). If the src operand is expressed in register 
addressing mode, the contents of the specified register are loaded into the PC. 
If the src operand is expressed in PC-relative mode, the assembler generates 
a displacement: displacement = label - (PC of branch instruction + 1 ) . This dis- 
placement is stored as a 16-bit signed integer in the 16 least significant bits 
of the branch instruction word. This displacement is added to the PC of the 
branch instruction plus 1 to generate the new PC. 

The TMS320C3x provides 20 condition codes that you can use with this in- 
struction (see Table 1 0-9 on page -1 3 for a list of condition mnemonics, condi- 
tion codes and flags). Condition flags are set on a previous instruction only 
when the destination register is one of the extended-precision registers (R7- 
R0) or when one of the compare instructions (CMPF, CMPF3, CMPI, CMPI3, 
TSTB, or TSTB3) is executed. 



Cycles 
Status Bits 



4 




LUF 


Unaffected 


LV 


Unaffected 


UF 


Unaffected 


N 


Unaffected 


Z 


Unaffected 


V 


Unaffected 


c 


Unaffected 



Mode Bit 



OVM Operation is not affected by OVM bit value. 
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1 

Branch Conditionally (Standard) Bcond 



Example bz ro 

Before Instruction: 

PC = 2B00h 
RO = 0003FF00h 

LUF LV UF N Z V C = 0 0 0 0 0 00 

After Instruction: 

PC = 3FF00h 
RO = 0003FF00h 

LUF LV UF N Z V C = 0 0 0 0 0 00 



Note: 

If a BZ instruction is executed immediately following a RND instruction with 

a 0 operand, the branch is not performed, because the 0 flag is not set. To 

circumvent this problem, execute a BZUF instead of a BZ instruction, 
i < i 
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BcondD Branch Conditionally (Delayed) 



Syntax 
Operation 



Operands 
Encoding 



31 



Description 



Cycles 
Status Bits 



BcondD src 

If cond is true: 

If src is in register-addressing mode (Rn, 0 * n * 27), 
src -* PC. 

If src is in PC-relative mode (label or address), 

displacement + PC + 3 -* PC. 
Else, continue. 

src conditional-branch addressing modes (B): 

0 register 

1 PC-relative 



t — rn — i — i — 

0 11 0 1 0 



24 23 



i — r 

ooo 



-i — i — i — r 

cond 



16 15 



87 



"i — i — i — i — i — i — i — i — i — i — i — r 

register or displacement 



Bcond D signifies a delayed branch that allows the three instructions after the 
delayed branch to be fetched before the PC is modified. The effect is a single- 
cycle branch, and the three instructions following Bcond D will not affect the 
cond. 

A branch is performed if the condition is true. If the src operand is expressed 
in register-addressing mode, the contents of the specified register are loaded 
into the PC. If the srcoperand is expressed in PC-relative mode, the assembler 
generates a displacement: displacement = label - (PC of branch instruction 
+ 3). This displacement is stored as a 1 6-bit signed integer in the 1 6 least sig- 
nificant bits of the branch instruction. This displacement is added to the PC of 
the branch instruction plus 3 to generate the new PC. The TMS320C3x pro- 
vides 20 condition codes that you can use with this instruction (see Table 1 0-9 
on page -1 3 for a list of condition mnemonics, condition codes, and flags) . Con- 
dition flags are set on a previous instruction only when the destination register 
is one of the extended-precision registers (R7-R0) or when one of the com- 
pare instructions (CMPF, CMPF3, CMPI, CMPI3, TSTB, or TSTB3) is ex- 
ecuted. 

I 



LUF 


Unaffected 


LV 


Unaffected 


UF 


Unaffected 


N 


Unaffected 


Z 


Unaffected 


V 


Unaffected 


c 


Unaffected 



Mode Bit 



OVM Operation is not affected by OVM bit value. 
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Branch Conditionally (Delayed} BcondD 



Example bnzd 36 (36 = 24h) 

Before Instruction: 

PC = 50h 

LUF LV UF N Z V C = 0 0 0 0 0 00 
After Instruction: 
PC = 77h 

LUF LV UF N Z V C = 0 0 0 0 0 00 
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BR Branch Unconditionally (Standard) 



Syntax 
Operation 
Operands 
Encoding 



BR src 

src -* PC or PC + disp PC, where disp = src - (PC + 1) 
src long-immediate addressing mode 



31 



I I I I I I 

0 1 1 0 0 0 0 



24 23 



Description 



Cycles 
Status Bits 



Mode Bit 
Example 



16 15 



87 



T — I — 111) — I — I — I — I — I I I I I I I I I I I 

disp 



BR performs a PC-relative branch that executes in four cycles, since a pipeline 
flush also occurs upon execution of the branch; see Section 9.2 on page 9-4. 
An unconditional branch is performed. The src operand is assumed to be a 
24-bit unsigned integer. Note that bit 24 = 0 for a standard branch. 



4 




LUF 


Unaffected 


LV 


Unaffected 


UF 


Unaffected 


N 


Unaffected 


Z 


Unaffected 


V 


Unaffected 


c 


Unaffected 



OVM Operation is not affected by OVM bit value. 
BR 805Ch 
Before Instruction: 
PC = 80h 

LUF LV UF N Z V C = 0 0 0 0 0 00 



After Instruction: 

PC = 805Ch 

LUF LV UF N Z V C 



0 0 0 0 0 0 0 
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Branch Unconditionally (Delayed) BRD 



Syntax 
Operation 
Operands 
Encoding 



BRD src 

src PC 

src long-immediate addressing mode 



31 




24 23 


16 15 




87 


0 




i i i — r— 




I l l I 


—i — i — i — i — i — r 


— i — i — i — r 


— 1 — 1 — 1 — IT 


1 1 1 1 


0 1 1 


0 0 0 0 


1 






src 







Description 



Cycles 
Status Bits 



Mode Bit 
Example 



BRD signifies a delayed branch that allows the three instructions after the 
delayed branch to be fetched before the PC is modified. The effect is a 
single-cycle branch. 

An unconditional branch is performed. The src operand is assumed to be a 
24-bit unsigned integer. Note that bit 24 = 1 for a delayed branch. 

1 



LUF 


Unaffected 


LV 


Unaffected 


UF 


Unaffected 


N 


Unaffected 


Z 


Unaffected 


V 


Unaffected 


c 


Unaffected 



OVM Operation is not affected by OVM bit value. 

BRD 2Ch 

Before Instruction: 
PC = 1Bh 

LUF LV UF N Z V C = 0 0 0 0 0 00 
After Instruction: 
PC = 2Ch 

LUF LV UF N Z V C = 0 0 0 0 0 00 



Assembly Language Instructions 1 0-61 



CALL Call Subroutine 



Syntax 
Operation 

Operands 
Encoding 



CALL src 

Next PC - *++SP 
src PC 

src long-immediate addressing mode 



31 




24 23 




16 15 


87 


0 


— T — 1 — 1 


I \ 1 1 




i i 


i i i 


1 — 1 1 1 — 1 1 1 1 — 1 


1 — 1 — 1 — 1 — 1 — 1 — 1 


1 1 1 


0 1 1 


0 0 0 1 


0 






src 







Description 

Cycles 
Status Bits 



Mode Bit 
Example 



A call is performed. The next PC value is pushed onto the system stack. The 
src operand is loaded into the PC. The src operand is assumed to be a 24-bit 
unsigned immediate operand. 



4 




LUF 


Unaffected 


LV 


Unaffected 


UF 


Unaffected 


N 


Unaffected 


Z 


Unaffected 


V 


Unaffected 


c 


Unaffected 



OVM Operation is not affected by OVM bit value. 

CALL 123456h 

Before Instruction: 

PC = 5h 

SP = 809801 h 

LUFLVUFNZVC = 000 0 0 00 

After Instruction: 

PC = 123456h 
SP = 809802h 
Data at 809802h = 6h 

LUF LV UF N Z V C = 0 0 0 0 0 00 
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Call Subroutine Conditionally CALLcond 



Syntax 
Operation 



Operands 



Encoding 



31 



CALLcond src 

If cond is true: 
Next PC *++SP 

If src is in register addressing mode (Rn, 0 * n * 27), 
src -* PC. 

If src is in PC-relative mode (label or address), 
displacement + PC + 1 -* PC. 

Else, continue. 

src conditional-branch addressing modes (B): 

0 register 

1 PC-relative 



i i i 

0 1110 0 



24 23 



0 0 0 0 



"i — i — r 

cond 



16 15 



i — i — i — i — r 



87 

1 — i — r 



t — r 



i — r 



register or displacement 



Description 



Cycles 
Status Bits 



A call is performed if the condition is true. If the condition is true, the next PC 
value is pushed onto the system stack. If the src operand is expressed in regis- 
ter addressing mode, the contents of the specified register are loaded into the 
PC. If the src operand is expressed in PC-relative mode, the assembler gener- 
ates a displacement: displacement = label - (PC of call instruction + 1). This 
displacement is stored as a 1 6-bit signed integer in the 1 6 least significant bits 
of the call instruction word. This displacement is added to the PC of the call 
instruction plus 1 to generate the new PC. 

The TMS320C3x provides 20 condition codes that can be used with this in- 
struction (see Table 1 0-9 on page -1 3 for a list of condition mnemonics, condi- 
tion codes, and flags). Condition flags are set on a previous instruction only 
when the destination register is one of the extended-precision registers (R7- 
R0) or when one of the compare instructions (CMPF, CMPF3, CMPI, CMPI3, 
TSTB, or TSTB3) is executed. 



5 




LUF 


Unaffected 


LV 


Unaffected 


UF 


Unaffected 


N 


Unaffected 


Z 


Unaffected 


V 


Unaffected 


c 


Unaffected 



Mode Bit 



OVM Operation is not affected by OVM bit value. 
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CALLcond Call Subroutine Conditionally 



Example callnz R5 

Before Instruction: 

PC = 123h 
SP = 809835h 
R5 = 789h 

LUF LV UF N Z V C = 0 0 0 0 0 00 
After Instruction: 

PC = 789h 
SP = 809836h 
R5 = 789h 

Data at 809836h = 124h 

LUFLVUFNZVC = 000 0 0 00 
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Compare Floating-Point CMPF 



Syntax 

Operation 

Operands 



Encoding 



Description 

Cycles 
Status Bits 



Mode Bit 
Example 



CMPF src, dst 
dst -src 

src general addressing modes (G): 

0 0 register (Rn, 0 * n * 7) 

0 1 direct 

1 0 indirect 
1 1 immediate 

dst register (Rn, 0 * n * 7) 



31 


24 23 


16 15 


87 


0 


0 0 0 


0 0 1 0 0 0 


i 

Q 


I I I I 

dst 


src 



The src operand is subtracted from the dst operand. The result is not loaded 
into any register, thus allowing for nondestructive compares. The dst and src 
operands are assumed to be floating-point numbers. 

1 

These condition flags are modified for all destination registers (R27-R0). 

LUF 1 if a floating-point underflow occurs; unchanged otherwise 

LV 1 if a floating-point overflow occurs; unchanged otherwise 

UF 1 if a floating-point underflow occurs; 0 otherwise 

N 1 if a negative result is generated; 0 otherwise 

Z 1 if a 0 result is generated; 0 otherwise 

V 1 if a floating-point overflow occurs; 0 otherwise 

C Unaffected 

OVM Operation is not affected by OVM bit value. 

CMPF *+AR4,R6 

Before Instruction: 
AR4 = 8098F2h 

R6 = 070C800000h = 1.4050e+02 

Data at 8098F3h = 070C8000h = 1 .4050e + 02 

LUF LV UF N Z V C = 0 0 0 0 0 0 0 

After Instruction: 

AR4 = 8098F2h 

R6 = 070C800000h = 1 .4050e + 02 

Data at 8098F3h = 070C8000h = 1 .4050e + 02 

LUF LV UF N Z V C = 0 0 0 0 1 0 0 
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CM PF3 ^Compare Floating -Point, 3-Operancl 



Syntax 

Operation 

Operands 



Encoding 



CMPF3 src2, srd 
srrf ~-src2 

srrf three-operand addressing modes (T): 
0 0 register (Rn1 , 0 * n1 * 7) 

0 1 indirect (disp = 0, 1 , IRO, IR1) 

1 0 register (Rn1 , 0 * n1 s 7) 

1 1 indirect (disp = 0, 1 , IRO, IR1) 

src2 three-operand addressing modes (T): 
0 0 register (Rn2, 0 * n2 * 7) 

0 1 register (Rn2, 0 * n2 * 7) 

1 0 indirect (disp = 0, 1, IRO, IR1) 
1 1 indirect (disp = 0,1, IRO, IR1) 



31 


24 23 


16 15 




87 




0 


1 1 

0 0 1 


1 1 1 1 I 

0 0 0 1 1 0 


1 

T 


iiii 

0 0 0 0 0 


1 1 1 1 1 1 1 

srd 


l I l I l I I 

src2 



Description 



Cycles 
Status Bits 



The src2operand is subtracted from the srrf operand. The result is not loaded 
into any register, thus allowing for nondestructive compares. The srrf and 
src2 operands are assumed to be floating-point numbers. Although this in- 
struction has only two operands, it is designated as a three-operand instruc- 
tion because operands are specified in the three-operand format. 

1 

These condition flags are modified for all destination registers (R27-R0). 

LUF 1 if a floating-point underflow occurs; unchanged otherwise 

LV 1 if a floating-point overflow occurs; unchanged otherwise 

UF 1 if a floating-point underflow occurs; 0 otherwise 

N 1 if a negative result is generated; 0 otherwise 

Z 1 if a 0 result is generated; 0 otherwise 

V 1 if a floating-point overflow occurs; 0 otherwise 

C Unaffected 



Mode Bit 



OVM Operation is not affected by OVM bit value. 
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Compare Floating-Point, 3-Operand CMPF3 



Example cmpf3 *ar2 , *ar3 — ( 1 ) 

Before Instruction: 

AR2 = 809831 h 
AR3 = 809852h 

Data at 809831 h = 77A7000h = 2.5044e + 02 
Data at 809852h = 57A2000h = 6.2531 25e + 01 
LUFLVUFNZVC = 000 0 0 00 

After Instruction: 

AR2 = 809831 h 
AR3 = 809851 h 

Data at 809831 h = 77A7000h = 2.5044e + 02 
Data at 809852h = 57A2000h = 6.2531 25e + 01 
LUF LV UF N Z V C = 0 0 0 1 0 0 0 

i 1 

Note: Cycle Count 

See subsection 9.5.2 on page 9-24 for operand ordering effects on cycle 
count. 

■ ■ 
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C M P l Compare Integer 



Syntax 

Operation 

Operands 



Encoding 



Description 

Cycles 
Status Bits 



Mode Bit 
Example 



CMPl src, dst 
dst -src 

src general addressing modes (G): 

0 0 register (Rn, 0 s n * 27) 

0 1 direct 

1 0 indirect 
1 1 immediate 

dst register (Rn, 0 * n * 27) 



31 


24 23 




16 15 


87 


0 


1 1 

0 0 0 


1 1 1 1 1 

0 0 1 0 0 1 


— 1 — 

G 


I I 1 1 

dst 


i i i i 


— i — i i i i — i — r— 

src 


1 \ 1 1 



The src operand is subtracted from the dst operand. The result is not loaded 
into any register, thus allowing for nondestructive compares. The dst and src 
operands are assumed to be signed integers. 

1 

These condition flags are modified for all destination registers (R27-R0). 

LUF Unaffected 

LV 1 if an integer overflow occurs; unchanged otherwise 

UF 0 

N 1 if a negative result is generated; 0 otherwise 

Z 1 if a 0 result is generated; 0 otherwise 

V 1 if an integer overflow occurs; 0 otherwise 

C 1 if a borrow occurs; 0 otherwise 

OVM Operation is not affected by OVM bit value. 

CMPl R3,R7 

Before Instruction: 



R3 = 898h = 2200 
R7 = 3E8h = 1000 
LUF LV UF N Z V C 

After Instruction: 

R3 = 898h = 2200 
R7 = 3E8h = 1000 
LUF LV UF N Z V C 



= 0000000 



= 0 0 0 1 0 0 1 



10-68 



Compare In teger, 3- Operand CMPI3 



Syntax 

Operation 

Operands 



Encoding 



CMPI3 src2, $rd 
srrf -src2 

srd three-operand addressing modes (T): 
0 0 register (Rn1 , 0 & n1 * 27) 

0 1 indirect (disp = 0, 1 , IRO, IR1) 

1 0 register (Rn1 , 0 s n1 * 27) 

1 1 indirect (disp = 0, 1 , IRO, IR1) 

src2 three-operand addressing modes (T): 
0 0 register (Rn2, 0 s n2 * 27) 

0 1 register (Rn2, 0 * n2 s 27) 

1 0 indirect (disp = 0, 1 , IRO, IR1) 
1 1 indirect (disp = 0, 1, IRO, IR1) 



31 


24 23 


16 15 




87 




0 


i i 
0 0 1 


I I I I I 

0 0 0 1 1 1 


1 

T 


I I I I 
0 0 0 0 0 


1 1 1 1 1 1 1 

srd 


I I I I I I I 
src2 



Description 



Cycles 
Status Bits 



The src2 operand is subtracted from the srd operand. The result is not loaded 
into any register, thus allowing for nondestructive compares. The srd and 
src2 operands are assumed to be signed integers. Although this instruction 
has only two operands, it is designated as a three-operand instruction be- 
cause operands are specified in the three-operand format. 

1 

These condition flags are modified for all destination registers (R27-R0). 

LUF Unaffected 

LV 1 if an integer overflow occurs; unchanged otherwise 

UF 0 

N 1 if a negative result is generated; 0 otherwise 

Z 1 if a 0 result is generated; 0 otherwise 

V 1 if an integer overflow occurs; 0 otherwise 

C 1 if a borrow occurs; 0 otherwise 



Mode Bit 



OVM Operation is not affected by OVM bit value. 
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CMPI3 Compare Integer, 3-Operand 



Example cmpi3 R7,R4 

Before Instruction: 



R7 = 03E8h = 1000 
R4 = 0898h = 2200 

LUF LV UF N Z V C = 0 0 0 0 0 00 

After Instruction: 

R7 = 03E8h = 1000 
R4 = 0898h = 2200 

LUF LV UF N Z V C = 0 0 0 0 0 00 



Note: Cycle Count 

See subsection 9.5.2 on page 9-24 for operand ordering effects on cycle 
count. 
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Decrement and Branch Conditionally (Standard) DBcond 



Syntax 
Operation 



Operands 



Encoding 



31 



Description 



DBcond ARn, src 

ARn - 1 ARn 

If cond is true and ARn st 0 : 

If src is in register addressing mode (Rn, 0 ss n ss 27), 
src -* PC. 

If src is in PC-relative mode (label or address), 

displacement + PC + 1 PC. 
Else, continue. 

src conditional-branch addressing modes (B): 

0 register 

1 PC-relative 

ARn register (0 * n s 7) 



24 23 



0 11 0 11 



-i — r 

ARn 



i — i — i — r 

cond 



16 15 



87 



1 — I — I — I — I — I — I — I — I 1 1 — I — I I I 

register or displacement 



DBcond signifies a standard branch that executes in four cycles because the 
pipeline must be flushed if cond is true. The specified auxiliary register is de- 
cremented and a branch is performed if the condition is true and the specified 
auxiliary register is greater than or equal to 0. The condition flags are those set 
by the last previous instruction that affects the status bits. 

The auxiliary register is treated as a 24-bit signed integer. The most significant 
eight bits are unmodified by the decrement operation. The comparison of the 
auxiliary register uses only the 24 least significant bits of the auxiliary register. 
Note that the branch condition does not depend on the auxiliary register decre- 
ment. 

If the src operand is expressed in register addressing mode, the contents of 
the specified register are loaded into the PC. If the src operand is expressed 
in PC-relative addressing mode, the assembler generates a displacement: 
displacement = label - (PC of branch instruction + 1 ). This integer is stored as 
a 16-bit signed integer in the 16 least significant bits of the branch instruction 
word. This displacement is added to the PC of the branch instruction plus 1 to 
generate the new PC. 

The TMS320C3x provides 20 condition codes that can be used with this in- 
struction (see Table 1 0-9 on page -1 3 for a list of condition mnemonics, condi- 
tion codes, and flags). Condition flags are set on a previous instruction only 
when the destination register is one of the extended-precision registers 
(R0-R7) or when one of the compare instructions (CMPF, CMPF3, CMPI, 
CMPI3, TSTB, or TSTB3) is executed. 
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DBcond Decrement and Branch Conditionally (Standard) 



Cycles 
Status Bits 



Mode Bit 
Example 



LUF 

LV 

UF 

N 

Z 

V 

c 



Unaffected 
Unaffected 
Unaffected 
Unaffected 
Unaffected 
Unaffected 
Unaffected 



OVM Operation is not affected by OVM bit value. 

CMPI 200, R3 
DBLT AR3,R2 



Before Instruction: 

PC = 5Fh 
AR3 = 12h 
R2 = 9Fh 
R3 = 80h 

LUF LV UF N Z V C = 0 



0 0 1 0 0 0 



After Instruction: 

PC = 9Fh 
AR3 = 11h 
R2 = 9Fh 
R3 = 80h 

LUF LV UF N Z V C = 0 



0 0 1 0 0 0 
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Decremen t ancf Bran ch Conditio nally ^Delayed^ D BCO n d D 



Syntax 
Operation 



Operands 



Encoding 



31 



DBcondD ARn, src 

ARn - 1 ARn 

If cond is true and ARN * 0: 

If src is in register addressing mode (Rn, 0 * n * 27) 
src -* PC 

If src is in PC-relative mode (label or address) 
displacement + PC + 3 -> PC. 

Else, continue. 

src conditional-branch addressing modes (B): 

0 register 

1 PC-relative 

ARn register (0 ss n <: 7) 



i i i i 

0 11 0 11 



24 23 



ARn 



t — r 



16 15 



cond 



T 



T 



T 



87 



register or displacement 



Description DBcond D signifies a delayed branch that allows the three instructions after 

the delayed branch to be fetched before the PC is modified. The effect is a 
single-cycle branch. The specified auxiliary register is decremented, and a 
branch is performed if the condition is true and the specified auxiliary register 
is greater than or equal to 0. The condition flags are those set by the last pre- 
vious instruction that affects the status bits. The three instructions following the 
DBcondD do not affect the cond. 

The auxiliary register is treated as a 24-bit signed integer. The most significant 
eight bits are unmodified by the decrement operation. The comparison of the 
auxiliary register uses only the 24 least significant bits of the auxiliary register. 
Note that the branch condition does not depend on the auxiliary register decre- 
ment. 

If the src operand is expressed in register-addressing mode, the contents of 
the specified register are loaded into the PC. If the src is expressed in PC-rela- 
tive addressing, the assembler generates a displacement: displacement = la- 
bel - (PC of branch instruction + 3). This displacement is added to the PC of 
the branch instruction plus 3 to generate the new PC. Note that bit 21 = 1 for 
a delayed branch. 
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DBcondD 



Cycles 
Status Bits 



Mode Bit 
Example 



The TMS320C3x provides 20 condition codes that you can use with this in- 
struction (see Table 10-9 on page 10-13 for a list of condition mnemonics, 
condition codes, and flags). Condition flags are set on a previous instruction 
only when the destination register is one of the extended-precision registers 
(R7-R0) or when one of the compare instructions (CMPF, CMPF3, CMPI, 
CMPI3, TSTB, or TSTB3) is executed. 



1 

LUF 

LV 

UF 

N 

Z 

V 

c 



Unaffected 
Unaffected 
Unaffected 
Unaffected 
Unaffected 
Unaffected 
Unaffected 



OVM Operation is not affected by OVM bit value. 

CMPI 26h,R2 
DBZD AR5 r $+110h 

Before Instruction: 

PC = 100h 
R2 = 26h 
AR5 = 67h 

LUF LV UF N Z V C = 0 0 0 0 0 00 

After Instruction: 

PC = 21 Oh 
R2 = 26h 
AR5 = 66h 

LUF LV UF N Z V C = 0 0 0 0 1 0 0 
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i-Point-to-lnteger Conversion FIX 



Syntax FIX src, dst 

Operation fix (src) dst 

Operands src general addressing modes (G): 

0 0 register (Rn, 0 * n * 7) 

0 1 direct 

1 0 indirect 
1 1 immediate 

dst any CPU register 

Encoding 



31 


24 23 






16 15 


87 


0 


— T — T— 

0 0 0 


— 1 — 1 — 1 — 1 — |— 

0 0 10 1 0 


— 1 — 

G 


I l l l 

dst 


I I I I I i I I I i i I i i i 

src 



Description 



Cycles 
Status Bits 



The floating-point operand src is converted to the nearest integer less than or 
equal to it in value, and the result is loaded into the dst register. The src oper- 
and is assumed to be a floating-point number and the dst operand a signed 
integer. 

The exponent field of the result register (if it has one) is not modified. 

Integer overflow occurs when the floating-point number is too large to be rep- 
resented as a 32-bit two's complement integer. In the case of integer overflow, 
the result will be saturated in the direction of overflow. 

1 

These condition flags are modified only if the destination register is R7-R0. 

LUF Unaffected 

LV 1 if an integer overflow occurs; unchanged otherwise 

UF 0 

N 1 if a negative result is generated; 0 otherwise 

Z 1 if a 0 result is generated; 0 otherwise 

V 1 if an integer overflow occurs; 0 otherwise 

C Unaffected 



Mode Bit 



OVM Operation is not affected by OVM bit value. 
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FIX Floating-Point-to-lnteger Conversion 



Example fix ri,r2 

Before Instruction: 

R1 = 0A28200000h = 1 .3454e + 3 
R2 = 0h 

LUF LV UF N Z V C = 0 0 0 0 0 00 

After Instruction: 

R1 = 0A28200000h = 134546 + 3 
R2 = 541h = 1345 

LUF LV UF N Z V C = 0 0 0 0 0 00 



10-76 



Parallel FIX and STI F1XHSTI 



Syntax FIX src2, dst1 

|| STI src3, d$t2 

Operation f\x(src2) -* dst1 

|| src3-+dst2 

Operands src2 indirect (disp = 0, 1 , IRO, IR1) 

dst 1 register (Rn1 , 0 & n1 * 7) 

src3 register (Rn2, 0 s: n2 * 7) 

dst2 indirect (disp = 0, 1 , IRO, IR1) 

Encoding 



31 




24 23 




16 15 




87 




0 


1 1 


r i t i "™ 

0 10 10 


i i 

dsn 


i i 

0 0 0 


I I 

srcQ 


i i i i i i i 

dst2 


I I I I I I I 

src2 



Description 



Cycles 
Status Bits 



A floating-point to integer conversion is performed. All registers are read at the 
beginning and loaded at the end of the execute cycle. This means that, if one 
of the parallel operations (STI) reads from a register, and the operation being 
performed in parallel (FIX) writes to the same register, STI accepts as input the 
contents of the register before it is modified by FIX. 

If src2 and d$t2 point to the same location, src2\$ read before the write to d$t2. 

Integer overflow occurs when the floating-point number is too large to be rep- 
resented as a 32-bit two's complement integer. In the case of integer overflow, 
the result will be saturated in the direction of overflow. 

1 

These condition flags are modified only if the destination register is R7-R0. 

LUF Unaffected 

LV 1 if an integer overflow occurs; unchanged otherwise 

UF 0 

N 1 if a negative result is generated; 0 otherwise 

Z 1 if a 0 result is generated; 0 otherwise 

V 1 if an integer overflow occurs; 0 otherwise 

C Unaffected 



Mode Bit 



OVM Operation is not affected by OVM bit value. 
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FIX||STI Parallel FIX and STI 



Example fix *++ar4(1),ri 

| | STI R0,*AR2 

Before Instruction: 

AR4 = 8098A2h 
R1 =0h 

RO = ODCh = 220 
AR2 = 80983Ch 

Data at 8098A3h = 733C000h = 1 .7950e + 02 
Data at 80983Ch = Oh 

LUF LV UF N Z V C - 0 0 0 0 0 00 
After Instruction: 

AR4 = 8098A3h 
R1 =0B3h = 179 
RO = ODCh = 220 
AR2 = 80983Ch 

Data at 8098A3h = 733C000h = 1 .79750e + 02 

Data at 80983Ch = ODCh = 220 

LUF LV UF N Z V C = 0 0 0 0 0 00 

i i 
Note: Cycle Count 

See subsection 9.5.2 on page 9-24 for operand ordering effects on cycle 
count. 
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Integer-to-Floating-Point Conversion 



FLOAT 



Syntax 

Operation 

Operands 



Encoding 



Description 

Cycles 
Status Bits 



Mode Bit 
Example 



FLOAT src, dst 
float (src) -* dst 

src general addressing modes (G): 

0 0 register (Rn, 0 n * 27) 

0 1 direct 

1 0 indirect 
1 1 immediate 

dst register (Rn, 0 * n * 7) 



31 


24 23 






16 15 


87 


0 


1 1 

0 0 0 


i i i i i 

0 0 10 11 


1 

G 


I I I I 

dst 


I I I I I I I I I I I I i I I 

src 



The integer operand src is converted to the floating-point value equal to it, and 
the result loaded into the dst register. The src operand is assumed to be a 
signed integer, and the dst operand a floating-point number. 

1 

These condition flags are modified only if the destination register is R7-R0. 

LUF Unaffected 

LV Unaffected 

UF 0 

N 1 if a negative result is generated; 0 otherwise 

Z 1 if a 0 result is generated; 0 otherwise 

V 0 

C Unaffected 

OVM Operation is not affected by OVM bit value. 

FLOAT *++AR2(2) ,R5 

Before Instruction: 

AR2 = 809800h 

R5 = 034C2000h = 1 .275781 25e + 01 

Data at 809802h = OAEh = 174 

LUF LV UF N Z V C = 0 0 0 0 0 00 

After Instruction: 

AR2 = 809802h 

R5 = 072E00000h = 1 .74e + 02 
Data at 809802h = OAEh = 174 
LUF LV UF N Z V C = 0 0 0 0 0 00 
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FLOATIISTF Parallel FLOAT and STF 



Syntax 

Operation 

Operands 

Encoding 



FLOAT src2,dst1 
|| STF src3, dst2 

1\oa\(src2 ) -+ dst1 
|| src3 -* dst2 

src2 indirect (disp = 0, 1 , IRO, IR1) 
dst 1 register (Rn1 , 0 * n1 * 7) 
$rc3 register (Rn2, 0 * n2 3 7) 
dst2 register (disp = 0, 1 , IRO, IR1) 



31 



24 23 



16 15 



87 





I I 1 1 


"'! 1 


" r i - 


— ( — , — 


— \ — I — 1 — 1 — 1 — 1 — 1 — 


—1 — 1 — 1 — 1 — 1 — I — 1— 


1 1 


0 10 11 


dsn 


0 0 0 


srcQ 


dst2 


src2 



Description 



Cycles 
Status Bits 



Mode Bit 



An integer to floating-point conversion is performed. All registers are read at 
the beginning and loaded at the end of the execute cycle. This means that if 
one of the parallel operations (STF) reads from a register and the operation 
being performed in parallel (FLOAT) writes to the same register, then STF ac- 
cepts as input the contents of the register before it is modified by FLOAT. 

If src2 and dst2 point to the same location, src2 is read before the write to dst2. 

1 

These condition flags are modified only if the destination register is R7-R0. 
LUF Unaffected 
LV Unaffected 
UF 0 

N 1 if a negative result is generated; 0 otherwise 
Z 1 if a 0 result is generated; 0 otherwise 
V 0 

C Unaffected 

OVM Operation is affected by OVM bit value. 
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Parallel FLOAT and STF FLOAT||STF 



Example float *+ar2(iro> ,R6 

| | STF R7,*AR1 

Before Instruction: 

AR2 = 8098C5h 
IRO = 8h 
R6 = 0h 

R7 = 034C200000h = 1 .275781 25e + 01 
AR1 = 809933h 

Data at 8098CDh = OAEh = 1 74 
Data at 809933h = Oh 

LUF LV UF N Z V C = 0 0 0 0 0 00 
After Instruction: 

AR2 = 8098C5h 
IRO = 8h 

R6 = 072E000000h = 1 .740e + 02 

R7 = 034C200000h = 1 .275781256 + 01 

AR1 = 809933h 

Data at 8098CDh = OAEh « 174 

Data at 809933h = 034C2000h = 1 .275781 25e + 01 

LUF LV UF N Z V C = 0 0 0 0 0 00 

i i 
Note: Cycle Count 

See subsection 9.5.2 on page 9-24 for operand ordering effects on cycle 
count. 

■ » 
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IACK Interr^ 



Syntax 
Operation 

Operands 
Encoding 



IACK src 

Perform a dummy read operation with IACK = 0. 
At end of dummy read, set IACK to 1 . 

src general addressing modes (G): 
0 1 direct 
1 0 indirect 



31 


24 23 


16 15 


87 


0 


0 0 0 


i i i i i 

11 0 110 


1 

G 


I I I I 

0 0 0 0 0 


1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 

src 



Description 



Cycles 
Status Bits 



A dummy read operation is performed. If off-chip memory is specified, IACK 
is set to 0 at half H 1 cycle after the beginning of the decode pha se of t he IACK 
instruction. At the first half of the H1 cycle of the dummy read, IACK is set to 
1 . Because of a multicycle read, the IACK signal will not be extended. This in- 
struction can be used to generate an external interrupt acknowledge. The 
IACK signal and the address can be used to signal interrupt acknowledge to 
external devices. The data read by the processor is unused. 

1 



LUF 


Unaffected 


LV 


Unaffected 


UF 


Unaffected 


N 


Unaffected 


Z 


Unaffected 


V 


Unaffected 


c 


Unaffected 



Mode Bit 
Example 



OVM Operation is not affected by OVM bit value. 

IACK *AR5 

Before Instruction: 



IACK = 1 
PC = 300h 

LUF LV UF N Z V C = 0 0 0 0 0 00 
After Instruction: 



IACK = 1 
PC = 301 h 

LUFLVUFNZVC = 000 0 0 00 
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Idle Until Interrupt IDLE 



Syntax 
Operation 

Operands 
Encoding 



31 



0 0 0 



Description 

Cycles 
Status Bits 



Mode Bit 
Example 



IDLE 

1 - ST(GIE) 
Next PC -* PC 
Idle until interrupt. 

None 



24 23 



"H — i — i — i — r 

0 0 110 0 



16 15 



87 



i i i — i — i — i — i — i — i — i — i — i — i — i — i — i — i i i — i — i — r~ 

0000000 00000 00000000001 



The global interrupt enable bit is set, the next PC value is loaded into the PC, 
and the CPU idles until an interrupt is received. When the interrupt is received, 
the contents of the PC are pushed onto the active system stack. 



1 

LUF 

LV 

UF 

N 

Z 

V 

c 



Unaffected 
Unaffected 
Unaffected 
Unaffected 
Unaffected 
Unaffected 
Unaffected 



OVM Operation is not affected by OVM bit value. 

IDLE ; The processor idles until a reset 

; or unmasked interrupt occurs. 
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IDLE2 Low-Power Idle 



Syntax 
Operation 

Operands 
Encoding 



31 



1 — I — 
0 0 0 



Description 



IDLE2 (TMS320LC31 Only) 

1 — ST(GIE) 
Next PC - PC 
Idle until interrupt. 

None 



24 23 



— i — i — i — i — r 

0 0 110 0 



16 15 



87 



i — i — i — i — i — i — i — i — i — i — i — i — i — i — i — i — i — i — i — i — i— 
ooooooo ooooo 00000000001 



The IDLE2 instruction serves the same function as IDLE, except that it re- 
moves the functional clock input from the internal device. This allows for ex- 
tremely low power mode. The PC is incremented once, and the device remains 
in an idle state until one of the external interrupts (INTO-3) is asserted. 

In IDLE2 mode, the 'C31 will behave as follows: 

□ The CPU, peripherals, and memory will retain their previous states. 

□ When the device is in the functional (nonemulation) mode, the clocks will 
stop with H1 high and H3 low. 

□ The 'LC31 w ill remain in IDLE2 until one of the four external interrupts 
(INT3-INT0) is asserted for at least two H1 cycles. When one of the four 
interrupts is asserted, the clocks start after a delay of one H1 cycle. The 
clocks can start up in the phase opposite that in which they were stopped 
(that is, H1 might start high when H3 was high before stopping, and H3 
might start high when H1 was high before stopping.) However, the H1 and 
H3 clocks remain 180° out of phase with each other. 

□ During IDLE2 operation, for one of the four external interrupts to be recog- 
nized by the CPU and serviced, it must be asserted for at least two H1 
cycles. For the processor to recognize only one interrupt when it restarts 
operation, the interrupt must be asserted for less than three cycles. 

□ When the 'LC31 is in emulation mode, the H1 and H3 clocks will continue 
to run normally, and the CPU will operate as if an IDLE instruction had been 
executed. The clocks continue to run for correct operation of the emulator. 



Delayed Branch 

For correct device operation, the three instructions after a delayed 
branch should not be IDLE or IDLE2 instructions. 



10-84 



Low-Power Idle IDLE2 



Cycles 
Status Bits 



Mode Bit 
Example 



1 




LUF 


Unaffected 


LV 


Unaffected 


UF 


Unaffected 


N 


Unaffected 


Z 


Unaffected 


V 


Unaffected 


c 


Unaffected 



OVM Operation is not affected by OVM bit value. 

IDLE 2 ; The processor idles until a reset 
; or unmasked interrupt occurs. 



Assembly Language Instructions 1 0-85 



LDE Load Floating-Point Exponent 



Syntax 

Operation 

Operands 



Encoding 



Description 



Cycles 
Status Bits 



Mode Bit 
Example 



LDE src, dst 
src(exp) -* cfef(exp) 

src general addressing modes (G): 

0 0 register (Rn, 0 * n * 7) 

0 1 direct 

1 0 indirect 
1 1 immediate 

dst register (Rn, 0$n$7) 



31 


24 23 


16 15 


87 


0 


■t r 

0 0 0 


1 1 1 1 1 

0 0 110 1 


1 

G 


1 1 1 1 

dst 


l I I I l l l I l I I 1 1 I I 

src 



The exponent field of the src operand is loaded into the exponent field of the 
dst register. No modification of the dst register mantissa field is made unless 
the value of the exponent loaded is the reserved value of the exponent for 0 
as determined by the precision of the src operand. Then the mantissa field of 
the dst register is set to 0. The src and dst operands are assumed to be float- 
ing-point numbers. Immediate values are evaluated in the short floating-point 
format. 

1 



LUF 


Unaffected 


LV 


Unaffected 


UF 


Unaffected 


N 


Unaffected 


Z 


Unaffected 


V 


Unaffected 


c 


Unaffected 



OVM Operation is not affected by OVM bit value. 

LDE R0,R5 

Before Instruction: 

R0 = 0200056F30h « 4.00066337e + 00 
R5 = 0A056FE332h = 1.06749648e + 03 
LUF LV UF N Z V C = 0 0 0 0 0 



0 0 



After Instruction: 

R0 = 0200056F30h = 4.00066337e + 00 
R5 = 02056FE332h = 4.1 699081 4e + 00 
LUF LV UF N Z V C = 0 0 0 0 0 



0 0 



10-86 



Load Floating-Point LDF 



Syntax LDF src, dst 

Operation src dst 

Operands src general addressing modes (G): 

0 0 register (Rn, 0 s n * 7) 

0 1 direct 

1 0 indirect 
1 1 immediate 

dst register (Rn, 0sns7) 

Encoding 



31 


24 23 




16 15 


87 


0 


0 0 0 


1 1 1 1 1 

0 0 1110 


" 1" " 

Q 


I l l l 

dst 


l I I I I l I I I l l l l I l 

src 



Description 

Cycles 
Status Bits 



Mode Bit 
Example 



The src operand is loaded into the dst register. The dst and src operands are 
assumed to be floating-point numbers. 

1 

These condition flags are modified only if the destination register is R7-R0. 
LUF Unaffected 
Unaffected 
0 

1 if a negative result is generated; 0 otherwise 
1 if a 0 result is generated; 0 otherwise 
0 

Unaffected 



LV 
UF 
N 
Z 
V 

c 

OVM 

LDF 



Operation is not affected by OVM bit value. 
@9800h,R2 
Before Instruction: 



DP = 80h 
R2 = 0h 

Data at 809800h = 10C52A00h = 2.19254303e + 00 
LUF LV UF N Z V C = 0 0 0 0 0 00 

After Instruction: 

DP = 80h 

R2 = 01 0C52A00h = 2.1 9254303e + 00 

Data at 809800h = 10C52A00h = 2.19254303e + 00 

LUF LV UF N Z V C = 0 0 0 0 0 00 



Assembly Language Instructions 1 0-87 



LDFcond Load Floating-Point Condition al^ 



Syntax 
Operation 



Operands 



LDFcond src, dst 

If cond\s true: 
src dst 

Else: 
dst\s unchanged. 

src general addressing modes (G): 

0 0 register (Rn, 0 * n * 7) 

0 1 direct 

1 0 indirect 
1 1 immediate 

dst register (Rn, 0 * n s; 7) 



Encoding 



31 


24 23 


16 15 


87 


0 


— 1 — 1 — 1 — 

0 10 0 


l l l l 

cond 


i 

G 


I I I I 

dst 


■ i i i i i ■ i i i i i ■ i • 

src 



Description If the condition is true, the src operand is loaded into the dst register, otherwise, 

the dst register is unchanged. The dst and src operands are assumed to be 
floating-point numbers. 

The TMS320C3x provides 20 condition codes that can be used with this in- 
struction (see Table 10-9 on page 10-13 for a list of condition mnemonics, 
condition codes, and flags). Note that an LDFU (load floating-point uncondi- 
tionally) instruction is useful for loading R7-R0 without affecting condition 
flags. Condition flags are set on a previous instruction only when the destina- 
tion register is one of the extended-precision registers (R7-R0) or when one 
of the compare instructions (CMPF, CMPF3, CMPI, CMPI3, TSTB, orTSTB3) 
is executed. 



Cycles 
Status Bits 



LUF 


Unaffected 


LV 


Unaffected 


UF 


Unaffected 


N 


Unaffected 


Z 


Unaffected 


V 


Unaffected 


c 


Unaffected 



Mode Bit 



OVM Operation is not affected by OVM bit value. 



10-88 



Load Floating-Point Conditionally LDFcond 



Example ldfz r3,rs 

Before Instruction: 

R3 = 2CFF2CD500h = 1.77055560e +13 
R5 = 5F0000003Eh = 3.961 40824e + 28 
LUF LV UF N Z V C = 0 0 0 0 1 0 0 

After Instruction: 

R3 = 2CFF2CD500h = 1.77055560e +13 
R5 = 2CFF2CD500h = 1.77055560e +13 
LUF LV UF N Z V C = 0 0 0 0 1 0 0 



Assembly Language Instructions 1 0-89 



LDF| Load Floating-Point, Interlocked 



Syntax 
Operation 

Operands 



Encoding 



31 



Description 



Cycles 
Status Bits 



Mode Bit 
Example 



LDFI src, dst 

Signal interlocked operation 
src dst 

src general addressing modes (G): 
0 1 direct 
1 0 indirect 

dst register (Rn, 0 * n * 7) 



24 23 



16 15 



87 





"■ T"T— i — r™i — 


— r 


— i — i — i — r 


— i — i — i — i — i — i — i — rn — i— 


i — i — i — i — r— 


0 0 0 


0 0 1111 


G 


dst 


src 





The src operand is loaded into the dst register. An interlocked operation is sig- 
naled over XF0 and XF1 . The src and dst operands are assumed to be floating- 
point numbers. Note that only direct and indirect modes are allowed. Refer to 
Section 6.4 on page 6-12 for detailed description. 

1 if XF1 = 0 (See Section 6.4 on page 6-12) 

These condition flags are modified only if the destination register is R7-R0. 
LUF Unaffected 
LV Unaffected 
UF 0 

N 1 if a negative result is generated; 0 otherwise 
Z 1 if a 0 result is generated; 0 otherwise 
V 0 
C Unaffected 

OVM Operation is not affected by OVM bit value. 

LDFI *+AR2,R7 

Before Instruction: 

AR2 = 8098F1h 
R7 = 0h 

Data at 8098F2h = 584C000h = - 6.281 25e + 01 
LUF LV UF N Z V C = 0 0 0 0 0 00 

After Instruction: 

AR2 = 8098F1h 

R7 = 0584C00000h = - 6.281 25e + 01 

Data at 8098F2h = 584C000h = - 6.281 25e + 01 

LUF LV UF N Z V C = 0 0 0 0 0 0 1 



10-90 



Parallel LDF and LDF LDFHLDF 



Syntax LDF src2, dst2 

|| LDF srrt, dst1 

Operation $rc2 dst2 

|| src1-+dst1 

Operands srd indirect (disp = 0, 1 , IRO, IR1) 

dst 1 register (Rn1 , 0 s n1 * 7) 
src2 indirect (disp = 0, 1 , IRO, IR1) 
dst2 register (Rn2, 0 * n2 * 7) 

Encoding 



31 




24 23 




16 15 




87 




0 


'T 

1 1 


r — i t — i — 

0 0 0 1 0 


i i 

dst2 


— 1 — 1 — 

dsn 


I i 

0 0 0 


1 1 1 1 1 1 1 

srd 




i — i — i — r— 

src2 


i i 



Description 

Cycles 
Status Bits 



Mode Bit 



Two floating-point loads are performed in parallel. If the LDFs load the same 
register, the assembler issues a warning. The result is that of LDF src2, dst2. 

1 



LUF 


Unaffected 


LV 


Unaffected 


UF 


Unaffected 


N 


Unaffected 


Z 


Unaffected 


V 


Unaffected 


c 


Unaffected 



OVM Operation is not affected by OVM bit value. 



Assembly Language Instructions 1 0-91 



LDF1ILDF Parallel LDF and LDF 



Example ldf * — ari ( iro ) , R7 

| | LDF *AR7++(1),R3 

Before Instruction: 

AR1 = 80985Fh 
IRO = 8h 
R7 = 0h 

AR7 = 80988Ah 
R3 = 0h 

Data at 809857h = 70C8000h = 1 .4050e + 02 
Data at 80988Ah = 57B4000h = 6.281 250e + 01 
LUF LV UF N Z V C = 0 0 0 0 0 0 0 

After Instruction: 

AR1 = 809857h 
R0 = 8h 

R7 = 070C800000h = 1 .4050e + 02 
AR7 = 80988Bh 

R3 = 057B400000h = 6.281 250e + 01 
Data at 809857h = 70C8000h = 1 .4050e + 02 
Data at 80988Ah - 57B4000h = 6.281 250e + 01 
LUF LV UF N Z V C = 0 0 0 0 0 00 

i 1 

Note: Cycle Count 

See subsection 9.5.2 on page 9-24 for operand ordering effects on cycle 
count. 



10-92 



Parallel LDF and STF LDFHSTF 



Syntax 

Operation 

Operands 

Encoding 



Description 

Cycles 
Status Bits 



LDF src2, dst1 
|| STF src3, dst2 

src2 -* dst1 
|| src3-»dst2 

src2 indirect (disp = 0,1, IRO, IR1) 
dst1 register (Rn1 , 0 * n1 as 7) 
src3 register (Rn2, 0 s; n2 * 7) 
d$t2 indirect (disp = 0, 1, IRO, IR1) 



31 




24 23 




16 15 




87 




0 


1 

1 1 


Till 

0 110 0 


1 1 

dsn 


i i 

0 0 0 


1 I 

srcQ 


i i i i i i i 

dst2 


I I I I I I I 

src2 



A floating-point load and a floating-point store are performed in parallel. 
If src2 and dst2 point to the same location, src2\s read before the write to dst2. 



1 




LUF 


Unaffected 


LV 


Unaffected 


UF 


Unaffected 


N 


Unaffected 


Z 


Unaffected 


V 


Unaffected 


c 


Unaffected 



Mode Bit 



OVM Operation is not affected by OVM bit value. 



Assembly Language Instructions 1 0-93 



LDF| |STF Parallel LDF and STF 



Example ldf *ar2 — ( l ) , Rl 

| | STF R3,*AR4++(IR1) 

Before Instruction: 

AR2 = 8098E7h 
R1 =0h 

R3 = 057B400000h = 6.281 25e + 01 
AR4 = 809900h 
IR1 = 10h 

Data at 8098E7h = 70C8000h = 1 .4050e + 02 
Data at 809900h = Oh 

LUF LV UF N Z V C = 0 0 0 0 0 00 
After Instruction: 

AR2 = 8098E6h 

R1 = 070C800000h = 1 .4050e + 02 
R3 = 057B400000h = 6.281 25e + 01 
AR4 = 80991 Oh 
IR1 = 10h 

Data at 8098E7h = 70C8000h = 1 .4050e + 02 
Data at 809900h = 57B4000h = 6.281 25e + 01 
LUF LV UF N Z V C = 0 0 0 0 0 00 



Note: Cycle Count 

See subsection 9.5.2 on page 9-24 for operand ordering effects on cycle 
count. 

* ■ 



10-94 



Load Integer LDI 



Syntax LDI src, dst 

Operation src dst 

Operands src general addressing modes (G): 

0 0 any CPU register 

0 1 direct 

1 0 indirect 
1 1 immediate 

dst any CPU register 

Encoding 



31 




24 23 




16 15 


87 


0 


™"T" T— 
0 0 0 


— r i 

0 1 


r ""T" 1 1'™" 
0 0 0 0 


F " 

Q 


I I I l 

dst 


i i i i i i i i i i i i i i i 

src 



Description The src operand is loaded into the dst register. The dst and src operands are 

assumed to be signed integers. An alternate form of LDI, LDP, is used to load 
the data page pointer register (DP). See the LDP instruction and subsec- 
tion 10.3.2 on page 10-16. 



Cycles 
Status Bits 



Mode Bit 
Example 



1 



These condition flags are modified only if the destination register is R7-R0. 

LUF Unaffected 

LV Unaffected 

UF 0 

N 1 if a negative result is generated; 0 otherwise 

Z 1 if a 0 result is generated; 0 otherwise 

V 0 

C Unaffected 

OVM Operation is not affected by OVM bit value. 

LDI *-ARl(IR0) ,R5 



Before Instruction: 



AR1 = 2Ch 
IR0 = 5h 

R5 = 3C5h = 965 
Data at 27h = 26h = 38 

LUF LV UF N Z V C = 0 0 0 0 0 00 



Assembly Language Instructions 1 0-95 



LDI Load Integer 



After Instruction: 

AR1 = 2Ch 

IRO = 5h 

R5 = 26h = 38 

Data at 27h = 26h = 38 

LUF LV UF N Z V C = 0 0 0 0 0 00 



10-96 



Load Integer Conditionally L DICOnd 



Syntax 
Operation 



Operands 



Encoding 



LDlcond src, dst 

If cond is true: 
src -* dst t 

Else: 
dst is unchanged. 

src general addressing modes (G): 

0 0 any CPU register 

0 1 direct 

1 0 indirect 
1 1 immediate 

dst any CPU register 



31 


24 23 


16 15 


87 


0 


0 10 1 


1 1 1 1 

cond 


i 

G 


1 1 1 1 

dst 


i i i i i i i i i i i i i i i 

src 



Description If the condition is true, the src operand is loaded into the dst register, otherwise, 

the dst register is unchanged. Regardless of the condition, the read of the src 
takes place. The dst and src operands are assumed to be signed integers. 

The TMS320C3x provides 20 condition codes that can be used with this in- 
struction (see Table 10-9 on page 10-13 for a list of condition mnemonics, 
condition codes, and flags). Note that an LDIU (load integer unconditionally) 
instruction is useful for loading R7-R0 without affecting the condition flags. 
Condition flags are set on a previous instruction only when the destination reg- 
ister is one of the extended-precision registers (R7-R0) or when one of the 
compare instructions (CMPF, CMPF3, CMPI, CMPI3, TSTB, or TSTB3) is ex- 
ecuted. 



Cycles 
Status Bits 



LUF 


Unaffected 


LV 


Unaffected 


UF 


Unaffected 


N 


Unaffected 


Z 


Unaffected 


V 


Unaffected 


c 


Unaffected 



Mode Bit 



OVM Operation is not affected by OVM bit value. 



Assembly Language Instructions 1 0-97 



LDICOnd Load Integer Conditionally 



Example ldiz *aro++,r6 

Before Instruction: 

ARO = 8098FO 

Data at 8098FOh = 027Ch = 636 
R6 = 0FE2h = 4,066 

LUF LV UF N Z V C = 0 0 0 0 0 00 
After Instruction: 

ARO = 8098F1h 

Data at 8098FOh = 027Ch = 636 

R6 = 0FE2h = 4,066 

LUF LV UF N Z V C = 0 0 0 0 0 00 



Note: Auxiliary Register Arithmetic 

The test condition does not affect the auxiliary register arithmetic. (AR 
modification will always occur.) 



10-98 



Load Integ er, Interlocked LDII 



Syntax 
Operation 

Operands 



Encoding 



Description 



Cycles 
Status Bits 



Mode Bit 
Example 



LDII src, dst 

Signal interlocked operation 
src dst 

src general addressing modes (G): 
0 1 direct 
1 0 indirect 

dst any CPU register 



31 




24 23 


16 15 


87 


0 


0 0 0 


— 1 — 1 

0 1 


I I l 

0 0 0 1 


1 

Q 


I I I I 

dst 


i i i i i i i i i i i i i i i 

src 



The src operand is loaded into the dst register. An interlocked operation is sig- 
naled over XFO and XF1 . The src and dst operands are assumed to be signed 
integers. Note that only the direct and indirect modes are allowed. Refer to 
Section 6.4 on page 6-12 for detailed description. 

1 if XF = 0 (See Section 6.4 on page 6-12) 

These condition flags are modified only if the destination register is R7-R0. 
LUF Unaffected 
LV Unaffected 
UF 0 

N 1 if a negative result is generated; 0 otherwise 
Z 1 if a 0 result is generated; 0 otherwise 
V 0 

C Unaffected 

OVM Operation is not affected by OVM bit value. 

LDII e985Fh,R3 

Before Instruction: 

DP = 80 
R3 = Oh 

Data at 80985Fh = ODCh 

LUF LV UF N Z V C = 0 0 0 0 0 00 

After Instruction: 

DP = 80 
R3 = ODCH 

Data at 80985Fh = ODCh 

LUF LV UF N Z V C = 0 0 0 0 0 00 



Assembly Language Instructions 1 0-99 



LDIHLDI Parallel LDI and LDI 



Syntax LDI src2, dst2 

|| LDI srd, d$t1 

Operation $rc2 — dst2 

|| srd — dst1 

Operands srd indirect (disp = 0, 1, IRO, IR1) 

dst1 register (Rn1 , 0 * n1 s7) 

src2 indirect (disp = 0,1, IRO, IR1) 

d$t2 register (Rn2, 0 * n2 & 7) 

Encoding 



31 




24 23 




16 15 




87 






0 


1 1 


1 1 1 1 

0 0 0 1 1 


1 1 

dst2 


i i 

dsn 


1 1 

0 0 0 


1 1 1 1 1 1 1 

srd 






t "r-i — r 
srcZ 


— r- 



Description 

Cycles 
Status Bits 



Two integer loads are performed in parallel. A warning is issued by the assem- 
bler if the LDIs load the same register. The result is that of LDI $rc2, d$t2. 

1 



LUF 


Unaffected 


LV 


Unaffected 


UF 


Unaffected 


N 


Unaffected 


Z 


Unaffected 


V 


Unaffected 


c 


Unaffected 



Mode Bit 



OVM Operation is not affected by OVM bit value. 



10-100 



Parallel LDI and LDI LDI||LDI 



Example ldi *— ari ( l ) , R7 

| | LDI *AR7++(IR0) ,R1 

Before Instruction: 

AR1 = 809826h 
R7 = 0h 

AR7 = 8098C8h 
IR0= 10h 
R1 =0h 

Data at 809825h = OFAh = 250 
Data at 8098C8h = 2EEh = 750 
LUF LV UF N Z V C = 0 0 0 0 0 00 

After Instruction: 

AR1 = 809826h 

R7 = OFAh = 250 

AR7 = 8098D8h 

IR0 = 10h 

R1 = 02EEh = 750 

Data at 809825h = OFAh = 250 

Data at 8098C8h = 2EEh - 750 

LUF LV UF N Z V C = 0 0 0 0 0 0 0 



Note: Cycle Count 

See subsection 9.5.2 on page 9-24 for operand ordering effects on cycle 
count. 



Assembly Language Instructions 10-101 



LDIHSTI Parallel LDI and STI 



LDI src2, dst1 
|| STI src3, dst2 

src2-+ dst1 
|| src3-+dst2 

src2 indirect (disp = 0, 1 , IRO, IR1) 
dst1 register (Rn1 , 0 s; n1 s7) 
src3 register (Rn2, 0 as n2 a: 7) 
dst2 indirect (disp = 0, 1 , IRO, IR1) 



24 23 16 15 87 0 



1 1 


1 1 I 1 11 

0 110 1 


1 1 

GfSfl 


1 1 1 11 1 

0 0 0 


i r 

src3 


1 1 I I 1 I I 

dst2 


— i — th — i — i — i — r— 

src2 



Description An integer load and an integer store are performed in parallel. If src2 and dst2 

point to the same location, $rc2 is read before the write to dst2. 

Cycles 1 

Status Bits LUF Unaffected 

LV Unaffected 

UF Unaffected 

N Unaffected 

Z Unaffected 

V Unaffected 

C Unaffected 

Mode Bit OVM Operation is not affected by OVM bit value. 



10-102 



Syntax 

Operation 

Operands 

Encoding 

31 



Parallel LDI and STI LDI||STI 



Example ldi *-ari(1),r2 

| | STI R7,*AR5++(IR0) 

Before Instruction: 

AR1 = 8098E7h 
R2 = 0h 
R7 = 35h = 53 
AR5 = 80982Ch 
IRO = 8h 

Data at 8098E6h = ODCh = 220 
Data at 80982Ch = Oh 

LUF LV UF N Z V C = 0 0 0 0 0 00 
After Instruction: 

AR1 - 8098E7h 
R2 = ODCh = 220 
R7 = 35h = 53 
AR5 = 809834h 
IRO = 8h 

Data at 8098E6h = ODCh = 220 

Data at 80982Ch = 35h - 53 

LUF LV UF N Z V C = 0 0 0 0 0 00 



Note: Cycle Count 

See subsection 9.5.2 on page 9-24 for operand ordering effects on cycle 
count. 



Assembly Language Instructions 10-103 



LDM Load Floating- Point ^ Mantissa 



Syntax LDM src, dst 

Operation src (man) dst (man) 

Operands src general addressing modes (G): 

0 0 register (Rn, 0 as n * 7) 

0 1 direct 

1 0 indirect 
1 1 immediate 

dst register (Rn, 0 * n * 7) 

Encoding 



31 




24 23 


16 15 


87 


0 


' T T " 
0 0 0 


r i 

0 1 


I I i 

0 0 10 


G 


i I I I 

dst 


i i i i i i i i i i i i i i i 

src 



Description 



Cycles 
Status Bits 



The mantissa field of the src operand is loaded into the mantissa field of the 
dst register. The dst exponent field is not modified. The src and dst operands 
are assumed to be floating-point numbers. If the src operand is from memory, 
the entire memory contents are loaded as the mantissa. If immediate address- 
ing mode is used, bits 1 5-1 2 of the instruction word are forced to 0 by the as- 
sembler. 

1 



LUF 


Unaffected 


LV 


Unaffected 


UF 


Unaffected 


N 


Unaffected 


Z 


Unaffected 


V 


Unaffected 


c 


Unaffected 



Mode Bit 
Example 



OVM Operation is not affected by OVM bit value. 

LDM 156.75,R2 (156.75 = 071CC00000h) 

Before Instruction: 



R2 = 0h 

LUF LV UF N Z V C 
After Instruction: 



0 0 0 0 0 0 0 



R2 = 001CC00000h = 1 .224609386 + 00 

LUF LV UF N Z V C = 0 0 0 0 0 00 



10-104 



Load Data Page Pointer LDP 



Syntax LDP src, DP 

Operation src -» data page pointer 

Operands src is the 8 MSBs of the absolute 24-bit source address (src). 

The DP" in the operand is optional. 

Encoding 



31 




24 23 


1615 87 




0 


i i 
0 0 0 


f 1 

0 1 


I 1 1 

0 0 0 0 


1 

1 1 


1 1 1 1 

1 0 0 0 0 


1 1 1 1 1 1 1 

00000000 


1 1 1 1 1 1 1 

src 



Description 



Cycles 
Status Bits 



Mode Bit 
Example 



This pseudo-op is an alternate form of the LDUI instruction, except that LDP 
is always in the immediate addressing mode. The src operand field contains 
the eight MSBs of the absolute 24-bit src address (essentially, only 
bits 23-16 of src are used). These eight bits are loaded into the eight LSBs 
of the data page pointer. 

The eight LSBs of the pointer are used in direct addressing as a pointer to the 
page of data being addressed. There is a total of 256 pages, each page 64K 
words long. Bits 31 -8 of the pointer are reserved and should be kept set to 0. 

1 



LUF 


Unaffected 


LV 


Unaffected 


UF 


Unaffected 


N 


Unaffected 


Z 


Unaffected 


V 


Unaffected 


c 


Unaffected 



OVM Operation is not affected by OVM bit value. 

LDP §809900h, DP 
or 

LDP @809900h 



Before Instruction: 



DP = 65h 

LUF LV UF N Z V C = 0 0 0 0 0 00 
After Instruction: 
DP = 80h 

LUF LV UF N Z V C = 0 0 0 0 0 00 



Assembly Language Instructions 1 0-1 05 



LOPOWER Divide Clock by 16 



Syntax 
Operation 
Operands 
Encoding 



31 



1 — |— 

0 0 0 



LOPOWER 

H1/16-+H1 
None 

23 



(TMS320LC31 Only) 



i i i i i 

1 0 0 0 0 1 



~i — n — i — i — i — i — i — i — i — i — i — i — m — i — i — i — i — i — i — i 

0 0 0 0 0 0 0 000000000 0000001 



Description Device continues to execute instructions, but at the reduced rate of the CLKIN 

frequency divided by 16 (that is, in LOPOWER mode, an 'LC31 with a CLKIN 
frequency of 32 MHz will perform in the same way as a 2-MHz 'LC31 , which 
has an instruction cycle time of 1 000 ns). This allows for low-power operation. 

The 'LC31 CPU slows down during the read phase of the LOPOWER instruc- 
tion. To exit the LOPOWER power-down mode, invoke the MAXSPEED 
instruction (opcode = 1080 0000 h). The 'LC31 resumes full-speed operation 
during the read phase of the MAXSPEED instruction. 



Delayed Branch 

Do not run the IDLE2 instruction in the LOPOWER mode. 



Cycles 
Status Bits 



Mode Bit 
Example 



LUF 


Unaffected 


LV 


Unaffected 


UF 


Unaffected 


N 


Unaffected 


Z 


Unaffected 


V 


Unaffected 


c 


Unaffected 



OVM Operation is not affected by OVM bit value. 

LOPOWER ; The processor slows down operation to 
; l/16th of the Hi clock. 
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Logical Shift LSH 



LSH count, dst 

If count *0: 
cfsf « count-* dst 

Else: 

dsf » |coi;nf | -* dsf 

counf general addressing modes (G): 

0 0 any CPU register 

0 1 direct 

1 0 indirect 
1 1 immediate 

dst any CPU register 



24 23 16 15 87 0 



—1 — 1 — 

0 0 0 


t i t r™i — 

0 10 0 11 


T — 

G 


—i — i — i — r- 

dst 


"7 — 1 — 1 — I — 1 — 1 — 1 — 1 — 1 — 1 — 1 — 1 — 1 — I — |— 

count 



Description The seven least significant bits of the count operand are used to generate the 

two's complement shift count. If the count operand is greater than 0, the dst 
operand is left-shifted by the value of the count operand. Low-order bits shifted 
in are 0-filled, and high-order bits are shifted out through the carry (C) bit. 

Logical left-shift: 
C+-dst*-Q 

If the count operand is less than 0, the dst is right-shifted by the absolute value 
of the count operand. The high-order bits of the dst operand are 0-filled as they 
are shifted to the right. Low-order bits are shifted out through the C bit. 

Logical right-shift: 
0-+dst-+C 

If the count operand is 0, no shift is performed, and the C bit is set to 0. The 
count operand is assumed to be a signed integer, and the dst operand is as- 
sumed to be an unsigned integer. 



Syntax 
Operation 

Operands 



Encoding 

31 
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LSH Logical Shift 



Cycles 1 

Status Bits These condition flags are modified only if the destination register is R7-R0. 

LUF Unaffected 

LV Unaffected 

UF 0 

N MSB of the output. 

Z 1 if a 0 output is generated; 0 otherwise 

V 0 

C Set to the value of the last bit shifted out. 0 for a shift count of 0. 



Mode Bit 
Example 1 



Example 2 



OVM Operation is not affected by OVM bit value. 

LSH R4,R7 

Before Instruction: 

R4 = 018h = 24 
R7 = 02ACh 

LUF LV UF N Z V C * 0 0 0 0 0 00 

After Instruction: 

R4 = 018h = 24 
R7 > OACOOOOOOh 

LUF LV UF N Z V C = 0 0 0 1 0 1 0 

LSH *-AR5(IRl) ,R5 

Before Instruction: 

AR5 = 809908h 
IRO = 4h 

R5 = 0012C00000h 

Data at 809904h = 0FFFFFFF4h = -12 

LUF LV UF N Z V C = 0 0 0 0 0 00 

After Instruction: 

AR5 = 809908h 
IRO = 4h 

R5 = 000001 2C00h 

Data at 809904h = 0FFFFFFF4h = -12 

LUF LV UF N Z V C = 0 0 0 0 0 00 
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Logic al Shift, 3-Operand LSH3 



Syntax 
Operation 



Operands 



Encoding 



Description 



LSH3 count, src, dst 

If count 25 0: 
src « count -+ dst 

Else: 

src» \count | -* ctef 

src three-operand addressing modes (T): 
0 0 any CPU register 

0 1 indirect (disp = 0, 1, IRO, IR1) 

1 0 any CPU register 

1 1 indirect (disp = 0, 1 , IRO, IR1) 

count three-operand addressing modes (T): 
0 0 any CPU register 

0 1 any CPU register 

1 0 indirect (disp = 0, 1 , IRO, IR1) 
1 1 indirect (disp = 0, 1, IRO, IR1) 

dst register (Rn, 0 * n * 27) 



31 


24 23 


16 15 




87 




0 


r ■■■r 1 

0 0 1 


l I I l l 

0 0 1 0 0 0 


1 

T 


I I l I 

dst 


i i i i i i i 
src 


i i i i i i I 

count 



The seven least significant bits of the count operand are used to generate the 
two's complement shift count 

If the count operand is greater than 0, a copy of the src operand is left-shifted 
by the value of the count operand, and the result is written to the dst. (The src 
is not changed.) Low-order bits shifted in are 0-filled, and high-order bits are 
shifted out through the C (carry) bit. 

Logical left-shift: 
C *- src «- 0 

If the count operand is less than 0, the src operand is right-shifted by the abso- 
lute value of the count operand. The high-order bits of the dst operand are 0- 
filled as they are shifted to the right. Low-order bits are shifted out through the 
C bit. 

Logical right-shift: 
0 -* src -* C 

If the count operand is 0, no shift is performed, and the C bit is set to 0. The 
count operand is assumed to be a signed integer. The src and dst operands 
are assumed to be unsigned integers. 
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LSH3 Logical Shift, 3-Operand 



Cycles 
Status Bits 



Mode Bit 
Example 1 



Example 2 



1 

These condition flags are modified only if the destination register is R7-R0. 
LUF Unaffected 
LV Unaffected 
UF 0 

N MSB of the output. 

Z 1 if a 0 output is generated; 0 otherwise 

V 0 

C Set to the value of the last bit shifted out. 0 for a shift count of 0. 
Unaffected if dst is not R7-R0. 

OVM Operation is not affected by OVM bit value. 

LSH3 R4,R7,R2 

Before Instruction: 



R4 = 018h = 24 
R7 = 02ACh 
R2 = 0h 

LUF LV UF N Z V C = 0 0 0 0 



0 0 0 



After Instruction: 

R4 = 018h = 24 

R7 = 02ACh 

R2 = OACOOOOOOh 

LUF LV UF N Z V C = 0 0 0 1 
LSH3 *-AR4(IRl) ,R5,R3 

Before Instruction: 

AR4 = 809908h 
IR1 = 4h 

R5 = 012C00000h 
R3 = 0h 

Data at 809904h = 0FFFFFFF4h = -12 
LUF LV UF N Z V C = 0 0 0 0 



0 1 0 



0 0 0 
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Logical Shift, 3-Operand LSH3 



After Instruction: 

AR4 = 809908h 
IR1 = 4h 

R5 = 012C00000h 
R3 = 000001 2C00h 

Data at 809904h = 0FFFFFFF4h = -12 

LUF LV UF N Z V C = 0 0 0 0 0 00 

i i 
Note: Cycle Count 

See subsection 9.5.2 on page 9-24 for operand ordering effects on cycle 
count. 

i » 
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LSH3| |STI Parallel LSH3 and STI 



Syntax 



Operation 



Operands 



Encoding 



LSH3 count, src2, d$t1 
|| STI src3, dst2 

If count* 0: 

src2« count-* dstl 
Else: 

$rc2 » \count | -* d$t1 
|| src3-+dst2 

count register (Rn1 , 0 as n1 $7) 
srd indirect (disp = 0, 1 , IRO, IR1) 
dstl register (Rn3, 0 * n3 s 7) 
src2 register (Rn4, 0 * n4 & 7) 
dst2 indirect (disp = 0, 1 , IRO, IR1) 



31 




24 23 




16 15 




87 




0 


1 1 


0 1110 


dsn 


1,1 1 1 i ■ 

count 


i i 

$rc3 


i i i i i i i 

dst2 


I I I I I I I 

src2 



Description The seven least significant bits of the count operand are used to generate the 

two's complement shift count. 

If the count operand is greater than 0, a copy of the src2 operand is left-shifted 
by the value of the count operand, and the result is written to the dstl. (The 
src2 is not changed.) Low-order bits shifted in are 0-filled, and high-order bits 
are shifted out through the C (carry) bit. 

Logical left-shift: 
C «- src2 «- 0 

If the count operand is less than 0, the $rc2 operand is right-shifted by the ab- 
solute value of the count operand. The high-order bits of the dst operand are 
0-filled as they are shifted to the right. Low-order bits are shifted out through 
the C (carry bit). 

Logical right-shift: 
0 src2 C 

If the count operand is 0, no shift is performed, and the carry bit is set to 0. 

The count operand is assumed to be a seven-bit signed integer, and the src2 
and dstl operands are assumed to be unsigned integers. All registers are read 
at the beginning and loaded at the end of the execute cycle. This means that 
if one of the parallel operations (STI) reads from a register and the operation 
being performed in parallel (LSH3) writes to the same register, STI accepts as 
input the contents of the register before it is modified by the LSH3. 
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Parallel LSH3 and STI LSH3| |STI 



If src2 and dst2 point to the same location, src2\s read before the write to dst2. 



Cycles 
Status Bits 



Mode Bit 
Example 1 



1 

These condition flags are modified only if the destination register is R7-R0. 

LUF Unaffected 

LV Unaffected 

UF 0 

N MSB of the output. 

Z 1 if a 0 output is generated; 0 otherwise 

V 0 

C Set to the value of the last bit shifted out. 0 for a shift count of 0. 

OVM Operation is affected by OVM bit value. 

LSH3 R2,*++AR3(1),R0 
| | STI R4 , *-AR5 

Before Instruction: 



R2= 18h = 24 
AR3 = 8098C2h 
R0 = 0h 

R4 = ODCh = 220 

AR5 = 8098A3h 

Data at 8098C3h = OACh 

Data at 8098A2h = Oh 

LUF LV UF N Z V C = 0 



0 0 0 0 0 0 



After Instruction: 



R2 = 18h = 24 

AR3 = 8098C3h 

R0 = OACOOOOOOh 

R4 = ODCh = 220 

AR5 = 8098A3h 

Data at 8098C3h = OACh 

Data at 8098A2h = ODCh = 220 

LUF LV UF N Z V C = 0 0 0 1 0 1 
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LSH3| |STI Parallel LSH3 and STI 



Example 2 lsh3 r7,*ar2 — (i),R2 

| | STI R0,*+AR0(1) 

Before Instruction: 

R7 = 0FFFFFFF4h = -12 
AR2 = 809863h 
R2 = Oh 

R0 = 12Ch = 300 

ARO = 8098B7h 

Data at 809863h = 2C000000h 

Data at 8098B8h = Oh 

LUFLVUFNZVC = 000 0 0 00 
After Instruction: 

R7 = 0FFFFFFF4h = -12 
AR2 = 809862h 
R2 = 2C000h 
R0 = 12Ch = 300 
ARO = 8098B7h 
Data at 809863h = 2C000000h 
Data at 8098B8h = 12Ch = 300 
LUF LV UF N Z V C = 0 0 0 0 0 00 

i i 
Note: Cycle Count 

See subsection 9.5.2 on page 9-24 for operand ordering effects on cycle 
count. 

i i 
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Restore Clock to Regular Speed MAXSPEED 



Syntax 
Operation 
Operands 
Encoding 



31 



MAXSPEED 

H1/16-+H1 
None 

23 



ooo 



— l — i — i — i — r 

1 0 0 0 0 1 



16 15 



87 



i — n — i — i — i — i — i — i — i — i — i — i — i — i — i — i — i — i — i — i — r— 

00000000000000000000000 



Description 

Cycles 
Status Bits 



Exits LOPOWER power-down mode (invoked by LOPOWER instruction with 
opcode 1 0800001 h). The 'LC31 resumes full-speed operation during the read 
phase of the MAXSPEED instruction. 

1 



LUF 


Unaffected 


LV 


Unaffected 


UF 


Unaffected 


N 


Unaffected 


Z 


Unaffected 


V 


Unaffected 


c 


Unaffected 



Mode Bit 
Example 



OVM Operation is not affected by OVM bit value. 

MAXSPEED ? The processor resumes full-speed operation. 
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MPYF Multiply Floating Point 



Syntax 

Operation 

Operands 



Encoding 



Description 

Cycles 
Status Bits 



Mode Bit 
Example 



MPYF src f dst 
dstx src-* dst 

src general addressing modes (G): 

0 0 register (Rn, 0 as n * 7) 

0 1 direct 

1 0 indirect 
1 1 immediate 

dst register (Rn, 0 * n * 7) 



31 




24 23 


16 15 


87 


0 


— 1 — 1 — 

0 0 0 


1 1 
0 1 


i i i 
0 10 0 


1 

G 


1 1 1 1 

dst 


iiiiii i ii iiiiii 

src 



The product of the dst and src operands is loaded into the dst register. The src 
operand is assumed to be a single-precision floating-point number, and the dst 
operand is an extended-precision floating-point number. 

1 

These condition flags are modified only if the destination register is R7-R0. 

LUF 1 if a floating-point underflow occurs; unchanged otherwise 

LV 1 if a floating-point overflow occurs; unchanged otherwise 

UF 1 if a floating-point underflow occurs; 0 otherwise 

N 1 if a negative result is generated; 0 otherwise 

Z 1 if a 0 result is generated; 0 otherwise 

V 1 if a floating-point overflow occurs; 0 otherwise 

C Unaffected 

OVM Operation is not affected by OVM bit value. 

MPYF R0,R2 

Before Instruction: 

R0 = 070C800000h = 1 .4050e + 02 

R2 = 034C200000h = 1. 275781 25e + 01 

LUF LV UF N Z V C = 0 0 0 0 0 00 

After Instruction: 

R0 = 070C800000h = 1 .4050e + 02 

R2 = 0A600F2000h = 1 .79247266e + 03 

LUF LV UF N Z V C = 0 0 0 0 0 00 
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MuMpfy l^Xf^. 



Syntax 

Operation 

Operands 



Encoding 



MPYF3 src2, srrt, dst 
srd x src2-+ dst 

srd three-operand addressing modes (T): 
0 0 register (Rn1 , 0 * n1 * 7) 

0 1 indirect (disp = 0, 1 , IRO, IR1) 

1 0 register (Rn1 , 0 * n1 * 7) 

1 1 indirect (disp = 0, 1, IRO, IR1) 

src2 three-operand addressing modes (T): 
0 0 register (Rn2, 0 * n2 * 7) 

0 1 register (Rn2, 0 ss n2 * 7) 

1 0 indirect (disp = 0, 1 , IRO, IR1) 
1 1 indirect (disp = 0, 1 , IRO, IR1) 

dst register (Rn, 0 * n * 7) 



31 


24 23 




16 15 




87 




0 


T™ 1 

0 0 1 


"1 — 1 — 1 — |— | 

0 0 1 0 0 1 


1 

T 


1 I I I 

dst 


i i i i i i i 

srd 


l l I l l l I 

src2 



Description 

Cycles 
Status Bits 



The product of the srd and src2 operands is loaded into the dst register. The 
srd and src2 operands are assumed to be single-precision floating-point 
numbers, and the cfef operand is an extended-precision floating-point number. 

1 

These condition flags are modified only if the destination register is R7-R0. 

LUF 1 if a floating-point underflow occurs; unchanged otherwise 

LV 1 if a floating-point overflow occurs; unchanged otherwise 

UF 1 if a floating-point underflow occurs; 0 otherwise 

N 1 if a negative result is generated; 0 otherwise 

Z 1 if a 0 result is generated; 0 otherwise 

V 1 if a floating-point overflow occurs; 0 otherwise 

C Unaffected 



Mode Bit 



OVM Operation is not affected by OVM bit value. 
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MPYF3 Multiply Floating point, 3-Operand 



Example 1 mpyf3 ro,r7,ri 

Before Instruction: 

RO = 057B400000h = 6.281 250e + 01 
R7 = 0733C00000h = 1 .797506 + 02 
R1 = Oh 

LUF LV UF N Z V C = 0 0 0 0 0 00 
After Instruction: 

R0 = 057B400000h = 6.281 250e + 01 

R7 = 0733C00000h = 1 .79750e + 02 

R1 = 0D306A3000h = 1 .12905469e + 04 

LUF LV UF N Z V C = 0 0 0 0 0 00 

Example 2 MPYF3 *+AR2 ( IRO ) , R7 , R2 

or 

MPYF3 R7 , *+AR2 ( IRO ) f R2 

Before Instruction: 

AR2 = 809800h 
IR0 = 12Ah 

R7 = 057B400000h = 6.281 250e + 01 
R2 = Oh 

Data at 80992Ah = 70C8000h = 1 .4050e + 02 
LUF LV UF N Z V C = 0 0 0 0 0 00 

After Instruction: 

AR2 = 809800h 
IR0 = 12Ah 

R7 = 057B400000h = 6.281 250e + 01 
R2 = 0D09E4A000h = 8.8251 5625e + 03 
Data at 80992Ah = 70C8000h = 1 .4050e + 02 
LUF LV UF N Z V C = 0 0 0 0 0 00 

i 1 

Note: Cycle Count 

See subsection 9.5.2 on page 9-24 for operand ordering effects on cycle 
count. 

i , i 
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Parallel MPYF3 andADDF3 MPYF3||ADDF3 



Syntax 
Operation 

Operands 



MPYF3 srcA, srcB, dst1 
ADDF3 srcC, srcD, dst2 



srcA x srcB - 
srcC + srcD - 



dst1 
>dst2 



srcA 
srcB 
srcC 
srcD_ 

dst1 



Any two indirect (disp = 0,1,IR0,IR1) 
Any two register (0 * Rn * 7) 



register (d1): 

0 = R0 

1 =R1 



Encoding 



31 



1 0 



dst2 



srd 
src2 
$rc3 
src4 



register (d2): 

0 = R2 

1 =R3 

register (Rn, 0 ^ n * 7) 

register (Rn, 0 * n * 7) 

indirect (disp = 0, 1 , IRO, IR1) 

indirect (disp = 0, 1 , IRO, IR1) 



P parallel addressing modes (0 * P * 3) 

Operation (P Field) 



00 
01 
10 
11 



src3 x src4, srd + src2 
src3 x srd, $rc4 + $rc2 
srd x src2, $rc3 + src4 
src3 x srtf, src2 + src4 



24 23 



0 0 0 0 



d1 



d2 



I i 



i — r 

src2 



16 15 



T 



87 



src3 



"T — I 

sro4 



Description A floating-point multiplication and a floating-point addition are performed in 

parallel. All registers are read at the beginning and loaded at the end of the 
execute cycle. This means that if one of the parallel operations (MPYF3) reads 
from a register and the operation being performed in parallel (ADDF3) writes 
to the same register, then MPYF3 accepts as input the contents of the register 
before it is modified by the ADDF3. 
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MPYF3HADDF3 



Parallel MPYF3 andADDF3 



Cycles 
Status Bits 



Mode Bit 
Example 



Any combination of addressing modes can be coded for the four possible 
source operands as long as two are coded as indirect and two are register. The 
assignment of the source operands srcA-srcD to the srd -src4 fields 
varies, depending on the combination of addressing modes used, and the P 
field is encoded accordingly. 

If src2 and dst2 point to the same location, $rc2 is read before the write to dst2. 
1 

These condition flags are modified only if the destination register is R7-R0. 

LUF 1 if a floating-point underflow occurs; unchanged otherwise 

LV 1 if a floating-point overflow occurs; unchanged otherwise 

UF 1 if a floating-point underflow occurs; 0 otherwise 

N 0 

Z 0 

V 1 if a floating-point overflow occurs; 0 otherwise 
C Unaffected 

OVM Operation is not affected by OVM bit value. 

MPYF3 *AR5++ ( 1 ) , * — AR1 ( IRO ) , RO 
I I ADDF3 R5,R7,R3 



Before Instruction: 

AR5 = 8098C5h 
AR1 = 8098A8h 
IRO = 4h 
R0 = 0h 

R5 = 0733C00000h = 1 .79750e + 02 
R7 = 070C800000h = 1 .4050e + 02 
R3 = 0h 

Data at 8098C5h = 34C0000h = 1 .2750e + 01 
Data at 8098A4h = 1110000h = 2.265625e + 00 
LUF LV UF N Z V C = 0 0 0 0 0 00 



Note: Cycle Count 

See subsection 9.5.2 on page 9-24 for operand ordering effects on cycle 
count. 
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Parallel MPYF3 andADDF3 MPYF3||ADDF3 



After Instruction: 

AR5 = 8098C6h 
AR1 = 8098A4h 
IRO = 4h 

RO = 04671 80000h = 2.888671 88e + 01 

R5 = 0733C00000h = 1.79750e + 02 

R7 = 070C800000h = 1 .4050e + 02 

R3 = 0820200000h = 3.20250e + 02 

Data at 8098C5h = 34C0000h = 1 .2750e + 01 

Data at 8098A4h = 1110000h = 2.265625e + 00 

LUF LV UF N Z V C = 0 0 0 0 0 00 
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MPYF3||STF Parallel MPYF3 and STF 



Syntax MPYF3 src2, srcl, dst 

|| STF src3, dst2 

Operation srd x src2 -* d$t1 

|| src3-+dst2 

Operands srd register (Rn1 , 0 a; n1 as 7) 

$rc2 indirect (disp = 0, 1 , IRO, IR1) 

d$t1 register (Rn3, 0 as n3 as 7) 

$rc3 register (Rn4, 0 a; n4 as 7) 

dst2 indirect (disp = 0, 1 , IRO, IR1) 

Encoding 



31 




24 23 




16 15 




87 




0 


1 1 


fil l 
0 1111 


dst) 


"T "T 
srd 


i I 

src3 


i i i i i i i 

dstZ 


I I I I I I l 

src2 



Description 



Cycles 
Status Bits 



A floating-point multiplication and a floating-point store are performed in paral- 
lel. All registers are read at the beginning and loaded at the end of the execute 
cycle. This means that if one of the parallel operations (MPYF3) writes to a reg- 
ister and the operation being performed in parallel (STF) reads from the same 
register, the STF accepts as input the contents of the register before it is modi- 
fied by the MPYF3. 

If src2 and dst2 point to the same location, $rc2 \s read before the write to d$t2. 
1 

These condition flags are modified only if the destination register is R7-R0. 

LUF 1 if a floating-point underflow occurs; 0 unchanged otherwise 

LV 1 if a floating-point overflow occurs; unchanged otherwise 

UF 1 if a floating-point underflow occurs; 0 otherwise 

N 1 if a negative result is generated; 0 otherwise 

Z 1 if a 0 result is generated; 0 otherwise 

V 1 if a floating-point overflow occurs; 0 otherwise 

C Unaffected 



Mode Bit 



OVM Operation is not affected by OVM bit value. 
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Parallel MPYF3 and STF MPYF3||STF 



Example mpyf3 *-ar2 ( i ) , r7 , ro 

| | STF R3 , *ARO — ( IRO ) 

Before Instruction: 

AR2 = 80982Bh 

R7 = 057B400000h = 6.281 250e + 01 
R0 = 0h 

R3 = 086B280000h = 4.7031 250e + 02 
ARO = 809860h 
IRO = 8h 

Data at 80982Ah = 70C8000h = 1 .4050e + 02 
Data at 809860h = Oh 

LUF LV UF N Z V C = 0 0 0 0 0 00 
After Instruction: 

AR2 = 80982Bh 

R7 = 057B400000h = 6.2812506 + 01 
RO = 0D09E4A000h = 8.8251 5625e + 03 
R3 = 086B280000h = 4.7031 250e + 02 
ARO = 809858h 
IRO = 8h 

Data at 80982Ah = 70C8000h = 1 .4050e + 02 
Data at 809860h = 86B280000h = 4.7031 250e + 02 
LUF LV UF N Z V C = 0 0 0 0 0 00 

i 1 

Note: Cycle Count 

See subsection 9.5.2 on page 9-24 for operand ordering effects on cycle 
count. 

■ ■ 
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MPYF3|[SUBF3 Parallel MPYF3 and SUBF3 



Syntax 
Operation 



WIPYF3 srcA, srcB, dstl 
SUBF3 srcC, srcD, dst2 



srcA x srcB - 
srcD-srcC- 



dstl 
>dst2 



Operands 



Encoding 



31 



srcA 
srcB 
srcC 
srcD 

dstl 



dst2 



srd 
$rc2 
src3 
src4 



Any two indirect (disp = 0,1 JR0JR1) 
Any two register (0 * Rn * 7) 



register (d1): 

0 = R0 

1 =R1 

register (d2): 

0 = R2 

1 =R3 

register (Rn, 0 * n * 7) 
register (Rn, 0 * n * 7) 
indirect (disp = 0, 1 , IRO, IR1 ) 
indirect (disp = 0, 1 , IRO, IR1) 



P parallel addressing modes (0 * P s: 3) 

Operation (P Field) 



i — 
1 o 



i — i — r~ 

0 0 0 1 



T 



00 
01 
10 

11 



24 23 



src3 x src4, srd - src2 
src3 x srd, src4 - src2 
srd x src2, src3 - src4 
src3 xsrrf, src2 - src4 



16 15 



di 



d2 



i — r 

src2 



"i — i — i — i — r 

src3 



87 



"1 — i — i — i — i — r 
sro4 



Description A floating-point multiplication and a floating-point subtraction are performed 

in parallel. All registers are read at the beginning and loaded at the end of the 
execute cycle. This means that if one of the parallel operations (MPYF3) reads 
from a register and the operation being performed in parallel (SUBF3) writes 
to the same register, MPYF3 accepts as input the contents of the register be- 
fore it is modified by the SUBF3. 
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Parallel MPYF3 and SUBF3 MPYF3HSUBF3 



Cycles 
Status Bits 



Mode Bit 
Example 



Any combination of addressing modes can be coded for the four possible 
source operands as long as two are coded as indirect and two are coded regis- 
ter. The assignment of the source operands srcA-srcD to the srd -src4 
fields varies, depending on the combination of addressing modes used, and 
the P field is encoded accordingly. 

1 

These condition flags are modified only if the destination register is R7-R0. 

LUF 1 if a floating-point underflow occurs; unchanged otherwise 

LV 1 if a floating-point overflow occurs; unchanged otherwise 

UF 1 if a floating-point underflow occurs; 0 otherwise 

N 0 

Z 0 

V 1 if a floating-point overflow occurs; 0 otherwise 

C Unaffected 

OVM Operation is not affected by OVM bit value. 



MPYF3 
| | SUBF3 

or 

MPYF3 
| | SUBF3 



R5,*++AR7(IR1) ,R0 
R7,*AR3 — (1) ,R2 

*++AR7(IRl), R5,R0 
R7,*AR3 — (1) ,R2 



Before Instruction: 

R5 = 034C000000h = 1 .2750e + 01 
AR7 = 809904h 
IR1 = 8h 
R0 = 0h 

R7 = 0733C00000h = 1.79750e + 02 
AR3 = 8098B2h 
R2 = 0h 

Data at 80990Ch = 1110000h = 2.250e + 00 
Data at 8098B2h = 70C8000h = 1 .4050e + 02 
LUF LV UF N Z V C = 0 0 0 0 0 0 
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MPYF3||SUBF3 Parallel MPYF3 and SUBF3 



After Instruction: 

R5 = 034C000000h = 1 .2750e + 01 
AR7 = 80990Ch 
IR1 = 8h 

R0 = 04671 80000h = 2.888671 88e + 01 
R7 = 0733C00000h = 1 .79750e + 02 
AR3 = 8098B1h 

R2 = 05E3000000h = - 3.92506 + 01 
Data at 80990Ch = 1110000h = 2.250e + 00 
Data at 8098B2h = 70C8000h = 1 .4050e + 02 
LUF LV UF N Z V C = 0 0 0 0 0 00 

i 1 

Note: Cycle Count 

See subsection 9.5.2 on page 9-24 for operand ordering effects on cycle 
count. 
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Multiply Integer MPYI 



Syntax 

Operation 

Operands 



Encoding 



Description 



Cycles 
Status Bits 



Mode Bit 
Example 



MPYI src t dst 
dst x src -* dst 

src general addressing modes (G): 

0 0 any CPU register 

0 1 direct 

1 0 indirect 
1 1 immediate 

dst any CPU register 



31 




24 23 


16 15 


87 


0 


" T 1 
0 0 0 


1 1 

0 1 


1 1 1 

0 10 1 


Q 


1 1 1 1 

dst 


1 1 I 1 1 I 1 I 1 I 1 I l I I 

src 



The product of the dst and src operands is loaded into the dst register. The src 
and dst operands, when read, are assumed to be 24-bit signed integers. The 
result is assumed to be a 48-bit signed integer. The output to the dst register 
is the 32 least significant bits of the result. 

Integer overflow occurs when any of the most significant 1 6 bits of the 48-bit 
result differs from the most significant bit of the 32-bit output value. 

1 

These condition flags are modified only if the destination register is R7-R0. 

LUF Unaffected 

LV 1 if an integer overflow occurs; unchanged otherwise 

UF 0 

N 1 if a negative result is generated; 0 otherwise 

Z 1 if a 0 result is generated; 0 otherwise 

V 1 if an integer overflow occurs; 0 otherwise 

C Unaffected 

OVM Operation is affected by OVM bit value. 

MPYI R1,R5 

Before Instruction: 

R1 = 000033C251h = 3,392,081 
R5 = 000078B600h = 7,910,912 
LUF LV UF N Z V C = 0 0 0 0 0 00 

After Instruction: 

R1 = 000033C251h = 3,392,081 

R5 = 00E21 D9600h = -501 ,377,536 

LUF LV UF N Z V C = 0 1 0 1 0 1 0 
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MPY)3 rfrnin ^ 



Syntax 

Operation 

Operands 



Encoding 



MPYI3 src2, srrt, dst 
srd x src2-* dst 

$rd three-operand addressing modes (T): 
0 0 any CPU register 

0 1 indirect (disp = 0, 1 , IRO, IR1) 

1 0 any CPU register 

1 1 indirect (disp = 0, 1 , IRO, IR1) 

src2 three-operand addressing modes (T): 
0 0 any CPU register 

0 1 any CPU register 

1 0 indirect (disp = 0, 1 , IRO, IR1) 
1 1 indirect (disp = 0,1, IRO, IR1) 

dst register (Rn, 0 * n * 27) 



31 


24 23 




16 15 




87 




0 


0 0 1 


0 0 10 1 0 


T 


I l 1 I 

dst 


I I l I I l I 

srd 


l I l I l l I 

src2 



Description 



Cycles 
Status Bits 



The product of the srd and src2 operands is loaded into the dst register. The 
srd and src2 operands are assumed to be 24-bit signed integers. The result 
is assumed to be a signed 48-bit integer. The output to the dst register is the 
32 least significant bits of the result. 

Integer overflow occurs when any of the most significant 16 bits of the 48-bit 
result differs from the most significant bit of the 32-bit output value. 

1 

These condition flags are modified only if the destination register is R7-R0. 

LUF Unaffected 

LV 1 if an integer overflow occurs; unchanged otherwise 

UF 0 

N 1 if a negative result is generated; 0 otherwise 

Z 1 if a 0 result is generated; 0 otherwise 

V 1 if an integer overflow occurs; 0 otherwise 

C Unaffected 



Mode Bit 



OVM Operation is affected by OVM bit value. 
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Multiply Integer, 3-Operand M PYI3 



Example 1 mpyi3 *ar4,*-ar1(1),r2 

Before Instruction: 

AR4 = 809850h 
AR1 = 8098F3h 
R2 = 0h 

Data at 809850h = OADh = 173 
Data at 8098F2h = ODCh = 220 
LUF LV UF N Z V C = 0 0 0 0 0 00 

After Instruction: 

AR4 = 809850h 

AR1 = 8098F3h 

R2 = 094ACh = 38,060 

Data at 809850h = OADh = 173 

Data at 8098F2h = ODCh = 220 

LUF LV UF N Z V C = 0 0 0 0 0 00 

Example 2 mpyi3 * — ar4(iroj ,R2,R7 

Before Instruction: 

AR4 = 8099F8h 
IRO = 8h 

R2 = 0C8h = 200 
R7 = 0h 

Data at 8099F0h = 32h = 50 

LUF LV UF N Z V C = 0 0 0 0 0 00 

After Instruction: 

AR4 = 8099F0h 
IRO = 8h 

R2 = 0C8h = 200 

R7 = 0271 Oh = 10,000 

Data at 8099F0h = 32h = 50 

LUF LV UF N Z V C = 0 0 0 0 0 00 



Note: Cycle Count 

See subsection 9.5.2 on page 9-24 for operand ordering effects on cycle 
count. 
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MPYI3HADDI3 Parallel MPYI3 andADDI3 



Syntax 
Operation 

Operands 



MPYI3 srcA, srcB, dst1 
ADDI3 srcC, srcD, dst2 

srcA x srcB -* dst1 
srcD + srcC d$t2 



srcA 
srcB 
srcC 
srcD 

dst1 



Any two indirect (disp = 0,1 ,IR0,IR1) 
Any two register (0 * Rn * 7) 



register (d1): 

0 = R0 

1 =R1 



dst2 



srd 
$rc2 
src3 
src4 



register (c/2): 

0 = R2 

1 =R3 

register (Rn, 0 as n * 7) 

register (Rn, 0 * n * 7) 

indirect (disp = 0, 1 , IRO, IR1) 

indirect (disp = 0, 1 , IRO, IR1) 

parallel addressing modes (0 * P * 3) 



Operation (P Field) 

00 
01 
10 
11 



src3 x src4, srd + src2 
src3 x srtf, src4 + src2 
srd x src2 t src3 + src4 
src3 x srd, src2 + src4 



Encoding 



31 



1 0 



I — I — I — 

0 0 10 



24 23 



T 



d1 



d2 



I r 

srd 



i — r 

src2 



16 15 



srcQ 



87 



i — i — i — i — r 

src4 



Description An integer multiplication and an integer addition are performed in parallel. All 

registers are read at the beginning and loaded at the end of the execute cycle. 
This means that if one of the parallel operations (MPYI3) reads from a register 
and the operation being performed in parallel (ADDI3) writes to the same reg- 
ister, then MPYI3 accepts as input the contents of the register before it is modi- 
fied by the ADDI3. 
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Parallel MPYI3 andADDI3 MPYI3| | ADDI3 



Cycles 
Status Bits 



Mode Bit 
Example 



Any combination of addressing modes can be coded for the four possible 
source operands as long as two are coded as indirect and two are coded as 
register. The assignment of the source operands srcA-srcD to the 
$rd - $rc4 fields varies, depending on the combination of addressing modes 
used, and the P field is encoded accordingly. To simplify processing when the 
order is not significant, the assembler may change the order of operands in 
commutative operations. 

1 

These condition flags are modified only if the destination register is R7-R0. 

LUF Unaffected 

LV 1 if an integer overflow occurs; unchanged otherwise 

UF 0 

N 0 

Z 0 

V 1 if an integer overflow occurs; 0 otherwise 

C Unaffected 

OVM Operation is affected by OVM bit value. 

MPYI3 R7,R4,R0 
|| ADD 1 3 *-AR3 , *AR5 — (1),R3 

Before Instruction: 

R7=14h = 20 
R4 = 64h = 100 
R0 = 0h 

AR3 = 80981 Fh 
AR5 = 80996Eh 
R3 = 0h 

Data at 80981 Eh = OFFFFFFCBh = - 53 

Data at 80996Eh = 35h = 53 

LUF LV UF N Z V C = 0 0 0 0 0 00 
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MPYI3||ADDI3 Parallel MPYI3 andADDI3 



After Instruction: 

R7 = 14h = 20 
R4 = 64h = 100 
R0 » 07D0h - 2000 
AR3 = 80981 Fh 
AR5 = 80996Dh 
R3 = 0h 

Data at 80981 Eh = OFFFFFFCBh = - 53 

Data at 80996Eh = 35h = 53 

LUF LV UF N Z V C = 0 0 0 0 0 00 



I I 

Note: Cycle Count 

See subsection 9.5.2 on page 9-24 for operand ordering effects on cycle 
count. 
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Parallel MPYI3 and STI MPYI3HSTI 



Syntax MPYI3 src2, srrf, dst1 

|| STI src3, dst2 

Operation srd x src2 -* dst 1 

|| src3-+dst2 

Operands srd register (Rn1 , 0 * n1 * 7) 

src2 indirect (disp = 0, 1 , IRO, IR1) 

dst1 register (Rn3, 0 * n3 s; 7) 

src3 register (Rn4, 0 s; n4 * 7) 

dst2 indirect (disp = 0, 1 , IRO, IR1) 

Encoding 



31 




24 23 




16 15 




87 




0 


1 

1 1 


i r ii 

1 0 0 0 0 


dsn 


i ""\ 

srd 


srcQ 


dst2 


I I I I I I I 

src2 



Description 



Cycles 
Status Bits 



An integer multiplication and an integer store are performed in parallel. All reg- 
isters are read at the beginning and loaded at the end of the execute cycle. This 
means that if one of the parallel operations (STI) reads from a register and the 
operation being performed in parallel (MPYI3) writes to the same register, STI 
accepts as input the contents of the register before it is modified by the MPYI3. 

If $rc2 and dst2 point to the same location, src2is read before the write to dst2. 

Integer overflow occurs when any of the most significant 1 6 bits of the 48-bit 
result differ from the most significant bit of the 32-bit output value. 

1 

These condition flags are modified only if the destination register is R7-R0. 

LUF Unaffected 

LV 1 if an integer overflow occurs; unchanged otherwise 

UF 0 

N 1 if a negative result is generated; 0 otherwise 

Z 1 if a 0 result is generated; 0 otherwise 

V 1 if an integer overflow occurs; 0 otherwise 

C Unaffected 



Mode Bit 



OVM Operation is affected by OVM bit value. 
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MPY13||STI Parallel MPYI3 and STI 



Example mpyi3 *++arO(1) ,R5,R7 

| | STI R2 , *-AR3 ( 1 ) 

Before Instruction: 

ARO = 80995Ah 
R5 = 32h = 50 
R7 = 0h 

R2 = 0DCh = 220 

AR3 = 80982Fh 

Data at 80995Bh = 0C8h = 200 

Data at 80982Eh = Oh 

LUF LV UF N Z V C = 0 0 0 0 0 00 
After Instruction: 

ARO = 80995Bh 

R5 = 32h = 50 

R7 = 271 Oh = 10000 

R2 = ODCh = 220 

AR3 = 80982Fh 

Data at 80995Bh = 0C8h = 200 

Data at 80982Eh = ODCh = 220 

LUF LV UF N Z V C = 0 0 0 0 0 00 



Note: Cycle Count 

See subsection 9.5.2 on page 9-24 for operand ordering effects on cycle 
count. 

i i 
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Parallel MPYI3 and SUBI3 MPYI3HSUB13 



Syntax 



Operation 



MPYI3 srcA, srcB, dst1 
SUBI3 srcC, srcD, dst2 

srcA x srcB -* dst1 
srcD-srcC-+ dst2 



Operands 



Encoding 



31 



1 0 



srcA 
srcB 
srcC 
srcD 

dst1 



dst2 



$rd 
src2 
src3 
src4 



Any two indirect (disp = 0,1,IR0,IR1) 
Any two register (0 s Rn * 7) 



register (d1): 

0 = R0 

1 =R1 

register {d2j: 

0 = R2 

1 =R3 

register (Rn, 0 * n * 7) 
register (Rn, 0 s n £ 7) 
indirect (disp = 0, 1, IRO, IR1) 
indirect (disp = 0, 1, IRO, IR1) 



P parallel addressing modes (0 * P s 3) 

Operation (P Field) 



— i — i — r— 

0 0 11 



00 
01 
10 
11 



24 23 



d1 



d2 



i — r 

src\ 



src3 x src4, srd - src2 
src3 x src/, src4 - src2 
srd x src2, src3 - src4 
src3 x srd, src2 - src4 



16 15 



"i — r 

src2 



87 



srcQ 



i i i r 
srcA 



Description An integer multiplication and an integer subtraction are performed in parallel. 

All registers are read at the beginning and loaded at the end of the execute 
cycle. This means that if one of the parallel operations (MPYI3) reads from a 
register and the operation being performed in parallel (SUBI3) writes to the 
same register, MPYI3 accepts as input the contents of the register before it is 
modified by the SUBI3. 
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MPYI3||SU BI3 Parallel MPYI3 and S UBI3 



Cycles 
Status Bits 



Mode Bit 
Example 



Any combination of addressing modes can be coded for the four possible 
source operands as long as two are coded as indirect and two are coded as reg- 
ister. The assignment of the source operands srcA-srcD to the srrf -src4 
fields varies, depending on the combination of addressing modes used, and the 
P field is encoded accordingly. To simplify processing when the order is not sig- 
nificant, the assembler may change the order of operands in commutative op- 
erations. 

Integer overflow occurs when any of the most significant 1 6 bits of the 48-bit 
result differs from the most significant bit of the 32-bit output value. 

1 

These condition flags are modified only if the destination register is R7-R0. 

LUF Unaffected 

LV 1 if an integer overflow occurs; unchanged otherwise 

UF 1 if an integer underflow occurs; 0 otherwise 

N 0 

Z 0 

V 1 if an integer overflow occurs; 0 otherwise 

C Unaffected 

OVM Operation is affected by OVM bit value. 

MPYI3 R2,*++AR0(1) ,R0 
| | SUBI3 *AR5 — (IR1),R4,R2 

or 

MPYI3 *++AR0 ( 1 ) , R2 , RO 
| | SUB 1 3 *AR5 — (IR1),R4,R2 

Before Instruction: 

R2 = 32h = 50 
ARO = 8098E3h 
R0 = 0h 

AR5 = 8099FCh 

IR1 =0Ch 

R4 = 07D0h = 2000 

Data at 8098E4h = 62h = 98 

Data at 8099FCh = 4B0h = 1200 

LUF LV UF N Z V C = 0 0 0 0 0 0 0 
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Parallel MPYI3 and SUBI3 MPYI3||SUBI3 



After Instruction: 

R2 = 320h = 800 

ARO = 8098E4h 

R0 = 01324h = 4900 

AR5 = 8099F0h 

IR1 = OCh 

R4 = 07D0h = 2000 

Data at 8098E4h = 62h = 98 

Data at 8099FCh = 4B0h = 1200 

LUF LV UF N Z V C > 0 0 0 0 0 00 

i i 
Note: Cycle Count 

See subsection 9.5.2 on page 9-24 for operand ordering effects on cycle 
count. 

■ ■ 
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N EG B Negative Integer With Borrow 



Syntax NEGB src, dst 

Operation O-src-C-* dst 

Operands src general addressing modes (G): 

0 0 any CPU register 

0 1 direct 

1 0 indirect 
1 1 immediate 

dsfany CPU register 

Encoding 



31 


24 23 




16 15 


87 


0 


""T T "' 
0 0 0 


0 10 110 


1 

G 


llll 

dst 


i i i i i i i i i i i i i i i 

src 



Description 

Cycles 
Status Bits 



Mode Bit 
Example 



The difference of the 0, src t and C operands is loaded into the dst register. The 
dst and src are assumed to be signed integers. 

1 

These condition flags are modified only if the destination register is R7-R0. 

LUF Unaffected 

LV 1 if an integer overflow occurs; unchanged otherwise 

UF 0 

N 1 if a negative result is generated; 0 otherwise 

Z 1 if a 0 result is generated; 0 otherwise 

V 1 if an integer overflow occurs; 0 otherwise 

C 1 if a borrow occurs; 0 otherwise 

OVM Operation is affected by OVM bit value. 
NEGB R5,R7 

Before Instruction: 



R5 = OFFFFFFCBh = - 53 
R7 = 0h 

LUF LV UF N Z V C = 0 0 0 0 0 0 1 

After Instruction: 

R5 = OFFFFFFCBh = -53 
R7 = 34h = 52 

LUF LV UF N Z V C = 0 0 0 0 0 0 1 
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Syntax 

Operation 

Operands 



Encoding 



Description 

Cycles 
Status Bits 



NEGF src, dst 
0- src-* dst 

src general addressing modes (G): 

0 0 register (Rn, 0 * n * 7) 

0 1 direct 

1 0 indirect 
1 1 immediate 

dst register (Rn, 0 ss n * 7) 



31 


24 23 




16 15 


87 


0 


1 1 

0 0 0 


1 1 1 1 1 

0 10 111 


1 

G 


I I I I 

dst 


i i i i i i i i i i i i i i i 

src 



The difference of the 0 and src operands is loaded into the dst register. The 
dst and src operands are assumed to be floating-point numbers. 

1 

These condition flags are modified only if the destination register is R7-R0. 

LUF 1 if a floating-point underflow occurs; unchanged otherwise 

LV 1 if a floating-point overflow occurs; unchanged otherwise 

UF 1 if a floating-point underflow occurs; 0 otherwise 

N 1 if a negative result is generated; 0 otherwise 

Z 1 if a 0 result is generated; 0 otherwise 

V 1 if a floating-point overflow occurs; 0 otherwise 

C Unaffected 



Mode Bit 
Example 



OVM Operation is affected by OVM bit value. 

NEGF *++AR3(2) ,R1 

Before Instruction: 

AR3 = 809800h 

R1 = 057B400025h = 6.281 25006e + 01 
Data at 809802h = 70C8000h = 1 .4050e + 02 
LUF LV UF N Z V C = 0 0 0 0 0 00 

After Instruction: 

AR3 = 809802h 

R1 = 07F3800000h = -1 .4050e + 02 

Data at 809802h = 70C8000h = 1 .4050e + 02 

LUF LV UF N Z V C = 0 0 0 0 0 00 
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NEGF||STF Parallel NEGF and STF 



Syntax NEGF src2, dst1 

|| STF src3, dst2 

Operation 0 - src2 -* dst1 

|| src3-+dst2 

Operands src2 indirect (disp = 0, 1 , IRO, IR1 ) 

dst 1 register (Rn1 , 0 * n1 * 7) 

src3 register (Rn2, 0 * n2 * 7) 

dst2 indirect (disp = 0,1, IRO, IR1) 

Encoding 



31 




24 23 




16 15 




87 




0 


1 1 


"T " i 1 i r*" 

1 0 0 0 1 


T J— 
dst\ 


"n — r— 

0 0 0 


I l 

src3 


I I l l l l l 

dst2 


1 1 1 1 1 1 1 

$rc2 



Description 



Cycles 
Status Bits 



A floating-point negation and a floating-point store are performed in parallel. 
All registers are read at the beginning and loaded at the end of the execute 
cycle. This means that if one of the parallel operations (STF) reads from a reg- 
ister and the operation being performed in parallel (NEGF) writes to the same 
register, STF accepts as input the contents of the register before it is modified 
by the NEGF. 

If src2 and dst2 point to the same location, src2 is read before the write to dst2. 
1 

These condition flags are modified only if the destination register is R7-R0. 

LUF 1 if a floating-point underflow occurs; 0 unchanged otherwise 

LV 1 if a floating-point overflow occurs; unchanged otherwise 

UF 1 if a floating-point underflow occurs; 0 otherwise 

N 1 if a negative result is generated; 0 otherwise 

Z 1 if a 0 result is generated; 0 otherwise 

V 1 if a floating-point overflow occurs; 0 otherwise 

C Unaffected 



Mode Bit 



OVM Operation is not affected by OVM bit value. 
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Parallel NEFG and STF NEGF||STF 



Example negf *ar4 — ( l ) , R7 

| | STF R2,*++AR5(1) 

Before Instruction: 

AR4 = 8098E1h 
R7 = 0h 

R2 = 0733C00000h = 1 .79750e + 02 
AR5 = 809803h 

Data at 8098E1h = 57B400000h = 6.281 250e + 01 
Data at 809804h = Oh 

LUF LV UF N Z V C = 0 0 0 0 0 00 
After Instruction: 

AR4 = 8098E0h 

R7 = 0584C00000h = - 6.281 250e + 01 
R2 = 0733C00000h = 1 .79750e + 02 
AR5 = 809804h 

Data at 8098E1h = 57B4000h = 6.281 250e + 01 
Data at 809804h = 733C000h = 1 .797506 + 02 
LUF LV UF N Z V C = 0 0 0 0 0 00 

i 1 

Note: Cycle Count 

See subsection 9.5.2 on page 9-24 for operand ordering effects on cycle 
count. 

i i 
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NEGI Negate Integer 



Syntax NEGI src, dst 

Operation 0 - src -* dst 

Operands src general addressing modes (G): 

0 0 any CPU register 

0 1 direct 

1 0 indirect 
1 1 immediate 

cfefany CPU register 

Encoding 



31 




24 23 


16 


15 


87 


0 


T" r - 

0 0 0 


T 1 

0 1 


i i i 

10 0 0 


1 

Q 


"1 '"1 1 1 T 1 
dst 


i i i i i i i i i i i i i i i 

src 



Description 

Cycles 
Status Bits 



Mode Bit 
Example 



The difference of the 0 and src operands is loaded into the dst register. The 
dst and src operands are assumed to be signed integers. 

1 

These condition flags are modified only if the destination register is R7-R0. 

LUF Unaffected 

LV 1 if an integer overflow occurs; unchanged otherwise 

UF 0 

N 1 if a negative result is generated; 0 otherwise 

Z 1 if a 0 result is generated; 0 otherwise 

V 1 if an integer overflow occurs; 0 otherwise 

C 1 if a borrow occurs; 0 otherwise 

OVM Operation is affected by OVM bit value. 

NEGI 174, R5 (174 - OAEh) 

Before Instruction: 



R5 = ODCh = 220 

LUF LV UF N Z V C 

After Instruction: 

R5 = 0FFFFFF52 = -174 
LUF LV UF N Z V C 



= 0000000 



= 0 0 0 1 0 0 1 
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Parallel NEGI and STI NEGI||STI 



Syntax NEGI src2, dst1 

|| STI src3, dst2 

Operation 0 - src2 -* dst1 

|| src3-+dst2 

Operands src2 indirect (disp = 0, 1, IRO, IR1) 

dst 1 register (Rn1 , 0 as n1 * 7) 

src3 register (Rn2, 0 ss n2 * 7) 

dst2 indirect (disp = 0, 1 , IRO, IR1) 

Encoding 



31 




24 23 




16 15 




87 




0 


i 

1 1 


■™r "i - m — 

10 0 10 


1 1 

dsn 


i i 

0 0 0 


l l 

srcQ 


I I i I I I I 

dsGt 


I I I I I I i 

src2 



Description 



Cycles 
Status Bits 



An integer negation and an integer store are performed in parallel. All registers 
are read at the beginning and loaded at the end of the execute cycle. This 
means that if one of the parallel operations (STI) reads from a register and the 
operation being performed in parallel (NEGI) writes to the same register, then 
STI accepts as input the contents of the register before it is modified by the 
NEGI. 

If src2 and dst2 point to the same location, src2 is read before the write to dst2. 
1 

These condition flags are modified only if the destination register is R7-R0. 
LUF Unaffected 

LV 1 if an integer overflow occurs; unchanged otherwise 
UF 0 

N 1 if a negative result is generated; 0 otherwise 
Z 1 if a 0 result is generated; 0 otherwise 
V 1 if an integer overflow occurs; 0 otherwise 
C 1 if a borrow occurs; 0 otherwise 



Mode Bit 



OVM Operation is affected by OVM bit value. 
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NEGIHSTI Parallel NEGI and STI 



Example negi *-ar3,r2 

| | STI R2,*AR1++ 

Before Instruction: 



AR3 = 80982Fh 

R2 = 19h = 25 

AR1 = 8098A5h 

Data at 80982Eh = ODCh = 220 

Data at 8098A5h ■ Oh 

LUF LV UF N Z V C = 0 0 0 0 0 0 0 

After Instruction: 

AR3 = 80982Fh 

R2 = 0FFFFFF24h = - 220 

AR1 = 8098A6h 

Data at 80982Eh = ODCh = 220 

Data at 8098A5h = 19h = 25 

LUF LV UF N Z V C = 0 0 0 1 0 0 1 



Note: Cycle Count 

See subsection 9.5.2 on page 9-24 for operand ordering effects on cycle 
count. 
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No Operation NOP 



Syntax 
Operation 

Operands 
Encoding 



31 



Description 

Cycles 
Status Bits 



Mode Bit 
Example 1 



Example 2 



NOP src 

No ALU or multiplier operations. 

ARn is modified if src is specified in indirect mode. 

src general addressing modes (G): 

0 0 register (no operation) 

1 0 indirect (modify ARn, 0 * n * 7) 



24 23 



16 15 



87 





— 1 — 1 — 1 — 1 — 1 — 




t — i — i — r— 


— 1 — I — 1 — 1 — 1 — 1 — 1 — I — 1 1 1 


i i i i 


0 0 0 


0 1 10 0 1 


G 


0 0 0 0 0 


src 





If the src operand is specified in the indirect mode, the specified addressing 
operation is performed, and a dummy memory read occurs. If the src operand 
is omitted, no operation is performed. 

1 

LUF Unaffected 

LV Unaffected 

UF Unaffected 

N Unaffected 

Z Unaffected 

V Unaffected 

C Unaffected 

OVM Operation is not affected by OVM bit value. 

NOP 

Before Instruction: 

PC = 3Ah 
After Instruction: 

PC = 3Bh 

NOP *AR3 — ( 1 ) 

Before Instruction: 

PC = 5h 
AR3 = 809900h 

After Instruction: 

PC = 6h 
AR3 = 8098FFh 
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NORM Normalize 



Syntax NORM src, dst 

Operation norm (src) -* dst 

Operands src general addressing modes (G): 

0 0 register (Rn, 0 s n * 7) 

0 1 direct 

1 0 indirect 
1 1 immediate 

Encoding 



31 


24 23 




16 15 


87 


0 


0 0 0 


f f i r f 

0 1 10 10 


G 


I I I I 

dst 


i i i i i i i i i i i i i i i 

src 



Description 



Cycles 
Status Bits 



The src operand is assumed to be an unnormalized floating-point number; that 
is, the implied bit is set equal to the sign bit. The dst is set equal to the normal- 
ized src operand with the implied bit removed. The dst operand exponent is 
set to the src operand exponent minus the size of the left-shift necessary to 
normalize the src. The dst operand is assumed to be a normalized floating- 
point number. 

If src (exp) = -1 28 and src (man) = 0, then dst = 0, Z = 1 , and UF = 0. If src (exp) 
= -1 28 and src (man) * 0, then dst = 0, Z = 0, and UF = 1 . For all other cases 
of the src t if a floating-point underflow occurs, then dst (roan) is forced to 0 and 
dst (exp) = -1 28. If src (man) = 0, then dst (man) = 0 and dst (exp) = -1 28. Re- 
fer to Section 4.6 on page 4-18 for more information. 

1 

These condition flags are modified only if the destination register is R7-R0. 
LUF 1 if a floating-point underflow occurs; unchanged otherwise 
Unaffected 

1 if a floating-point underflow occurs; 0 otherwise 
1 if a negative result is generated; 0 otherwise 
1 if a 0 result is generated; 0 otherwise 



LV 
UF 
N 
Z 
V 

c 



Mode Bit 



OVM 



Unaffected 

Operation is not affected by OVM bit value. 
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Normalize NORM 



Example norm ri,r2 

Before Instruction: 

R1 = 0400003AF5h 
R2 = 070C800000h 

LUF LV UF N Z V C = 0 0 0 0 0 00 
After Instruction: 

R1 = 0400003AF5h 

R2 = F26BD40000h = 1 .124516136 - 04 
LUFLVUFNZVC = 000 0 0 00 
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NOT Bitwise Logical-Complement 



Syntax 

Operation 

Operands 



Encoding 



Description 

Cycles 
Status Bits 



Mode Bit 
Example 



NOT src, dst 
-src -* dst 

src general addressing modes (G): 

0 0 any CPU register 

0 1 direct 

1 0 indirect 
1 1 immediate 

cfsfany CPU register 



31 


24 23 


16 15 




87 




0 


1 1 


l l l l l 




I l I I 


I I 


1 1 1 


1 — i — n nr— 


t— i—i r-r 




0 0 0 


0 110 11 


Q 


dst 






src 







The bitwise logical-complement of the src operand is loaded into the dst regis- 
ter. The complement is formed by a logical-NOT of each bit of the src operand. 
The dst and src operands are assumed to be unsigned integers. 

1 

These condition flags are modified only if the destination register is R7-R0. 
LUF Unaffected 
LV Unaffected 
UF 0 

N MSB of the output 

Z 1 if a 0 result is generated; 0 otherwise 

V 0 

C Unaffected 

OVM Operation is affected by OVM bit value. 

NOT 8982Ch,R4 

Before Instruction: 

DP = 80h 
R4 = 0h 

Data at 80982Ch = 5E2Fh 
LUF LV UF N Z V C = 0 



0 0 0 0 0 0 



After Instruction: 

DP = 80h 

R4 = 0FFFFA1 DOh 
Data at 80982Ch = 5E2Fh 
LUF LV UF N Z V C = 0 



0 0 1 0 0 0 
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Parallel NOT and STI NOTHSTI 



Syntax NOT src2, dst1 

|| STI src3, d$t2 

Operation ~src2 -* dst 1 

|| src3-+dst2 

Operands src2 indirect (disp = 0, 1 , IRO, IR1) 

dst1 register (Rn1 , 0 * n1 $7) 

src3 register (Rn2, 0 * n2 s 7) 

dst2 indirect (disp = 0, 1 , IRO, IR1) 

Encoding 



31 




24 23 




16 15 




87 




0 


1 1 


10 0 11 


"n — 

dsn 


— i — i — 

0 0 0 


l l 

srcQ 


i i i i i i i 

dst2 


I I I I I I I 

src2 



Description 



Cycles 
Status Bits 



A bitwise logical-NOT and an integer store are performed in parallel. All regis- 
ters are read at the beginning and loaded at the end of the execute cycle. This 
means that if one of the parallel operations (STI) reads from a register and the 
operation being performed in parallel (NOT) writes to the same register, STI 
accepts as input the contents of the register before it is modified by the NOT. 

If src2 and dst2 point to the same location, src2 \s read before the write to dst2. 
1 

These condition flags are modified only if the destination register is R7-R0. 
LUF Unaffected 
LV Unaffected 
UF 0 

N MSB of the output 

Z 1 if a 0 result is generated; 0 otherwise 

V 0 

C Unaffected 



Mode Bit 



OVM Operation is not affected by OVM bit value. 
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NOTHSTI Parallel NOT and STI 



Example not *+ar2,r3 

|| STI R7 , * — AR4 ( IR1 ) 

Before Instruction: 

AR2 = 8099CBh 
R3 = 0h 

R7 = ODCh = 220 
AR4 = 809850h 
IR1=10h 

Data at 8099CCh = 0C2Fh 
Data at 809840h - Oh 

LUF LV UF N Z V C = 0 0 0 0 0 00 

After Instruction: 

AR2 = 8099CBh 
R3 = 0FFFFF3D0h 
R7 = ODCh = 220 
AR4 = 809840h 
IR1 = 10h 

Data at 8099CCh = 0C2Fh 

Data at 809840h = ODCh = 220 

LUF LV UF N Z V C = 0 0 0 1 0 0 0 

i i 
Note: Cycle Count 

See subsection 9.5.2 on page 9-24 for operand ordering effects on cycle 
count. 
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Bitwise Logical-OR OR 



Syntax 

Operation 

Operands 



Encoding 



Description 

Cycles 
Status Bits 



Mode Bit 
Example 



OR src, dst 
dst OR src -* dst 

src general addressing modes (G): 

0 0 any CPU register 

0 1 direct 

1 0 indirect 

1 1 immediate (not sign-extended) 
dsfany CPU register 



31 


24 23 


16 15 


87 


0 


0 0 0 


1 0 0 0 0 0 


1 

Q 


l I l I 

dst 


i i i i i i i i i i i i i i i 

src 



The bitwise logical OR between the src and dst operands is loaded into the dst 
register. The dst and src operands are assumed to be unsigned integers. 

1 

These condition flags are modified only if the destination register is R7-R0. 

LUF Unaffected 

LV Unaffected 

UF 0 

N MSB of the output 

Z 1 if a 0 result is generated; 0 otherwise 

V 0 

C Unaffected 

OVM Operation is not affected by OVM bit value. 

OR *++ARl(IRl) ,R2 

Before Instruction: 

AR1 = 809800h 
IR1 = 4h 

R2 = 012560000h 

Data at 809804h = 2BCDh 

LUF LV UF N Z V C = 0 0 0 0 0 00 

After Instruction: 

AR1 = 809804h 
IR1 = 4h 

R2 = 012562BCDh 

Data at 809804h = 2BCDh 

LUF LV UF N Z V C = 0 0 0 0 0 00 
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0R3 Bitwise Logical-OR, 3-Operand 



Syntax 

Operation 

Operands 



Encoding 



31 



Description 

Cycles 
Status Bits 



OR3 src2, srrf, dst 
srrf OR src2-> cfsf 

srrf three-operand addressing modes (T): 
0 0 register (Rn1 , 0 n1 * 27) 

0 1 indirect (disp = 0, 1 , IRO, IR1) 

1 0 register (Rn1 , 0 s; n1 s; 27) 

1 1 indirect (disp = 0, 1 , IRO, IR1) 

src2 three-operand addressing modes (T): 
0 0 register (Rn2, 0 * n2 * 27) 

0 1 register (Rn2, 0 s n2 s; 27) 

1 0 indirect (disp = 0, 1 , IRO, IR1) 
1 1 indirect (disp = 0, 1, IRO, IR1) 

cfsf register (Rn, 0 s; n * 27) 



24 23 



16 15 



87 



-1 7" 


— T"-T" r—r r™ 


— 1 — 


"'T rr "i 


— l — r — i — i — i — i — i — 


t — r t — ri — i — r 


0 0 1 


0 0 10 1 1 


T 


dst 


srd 


src2 



The bitwise logical-OR between the srd and src2 operands is loaded into the 
dst register. The srd, src2, and dst operands are assumed to be unsigned in- 
tegers. 

1 

These condition flags are modified only if the destination register is R7-R0. 

LUF Unaffected 

LV Unaffected 

UF 0 

N MSB of the output 

Z 1 if a 0 result is generated; 0 otherwise 

V 0 

C Unaffected 



Mode Bit 



OVM Operation is not affected by OVM bit value. 
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Bitwise Logical-PR, 3-Operand 0R3 



Example 0R3 *++ar1(iri> ,R2,R7 

Before Instruction: 

AR1 = 809800h 
IR1 = 4h 

R2 = 012560000h 
R7 = 0h 

Data at 809804h = 2BCDh 

LUF LV UF N Z V C = 0 0 0 0 0 00 

After Instruction: 

AR1 = 809804h 
IR1 = 4h 

R2 = 012560000h 

R7 = 012562BCDh 

Data at 809804h = 2BCDh 

LUF LV UF N Z V C = 0 0 0 0 0 00 



Note: Cycle Count 

See subsection 9.5.2 on page 9-24 for operand ordering effects on cycle 
count. 

■ * 
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0R3||ST1 Parallel 0R3 and STI 



Syntax 0R3 src2, srrt, dst1 

|| STI src3, dst2 

Operation srd OR $rc2 -* dst1 

| src3 -* dst2 

Operands srd register (Rn1 , 0 * n1 * 7) 

$rc2 indirect (disp = 0, 1, IRO, IR1) 
dst 1 register (Rn2, 0 * n2 s 7) 
src3 register (Rn3, 0 as n3 * 7) 
cfs«2 indirect (disp = 0, 1, IRO, IR1) 

Encoding 



31 




24 23 




16 15 




87 




0 


1 1 


i i i i 

10 10 0 


cfsrt 


srd 


i i 

srcQ 


i i i i i i i 

dst2 


1 1 1 1 1 1 1 

src2 



Cycles 
Status Bits 



A bitwise logical-OR and an integer store are performed in parallel. All regis- 
ters are read at the beginning and loaded at the end of the execute cycle. This 
means that if one of the parallel operations (STI) reads from a register and the 
operation being performed in parallel (OR3) writes to the same register, then 
STI accepts as input the contents of the register before it is modified by the 
OR3. 

If src2 and dst2 point to the same location, $rc2\s read before the write to dst2. 
1 

These condition flags are modified only if the destination register is R7-R0. 
LUF Unaffected 
LV Unaffected 
UF 0 

N MSB of the output 

Z 1 if a 0 result is generated; 0 otherwise 

V 0 

C Unaffected 



Mode Bit 



OVM Operation is not affected by OVM bit value. 
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Parallel OR3 and STI 0R3| |STI 



Example 0R3 * ++AR2 , R5 , R2 

| | STI R6,*AR1 — 

Before Instruction: 



AR2 = 809830h 
R5 = 800000h 
R2 = 0h 

R6 = ODCh = 220 
AR1 = 809883h 
Data at 809831 h = 9800h 
Data at 809883h = Oh 

LUFLVUFNZVC = 000 0 0 00 
After Instruction: 

AR2 = 809831 h 

R5 = 800000h 

R2 = 809800h 

R6 = ODCh = 220 

AR1 = 809882h 

Data at 809831 h = 9800 h 

Data at 809883h = ODCh = 220 

LUFLVUFNZVC = 000 0 0 00 

i 1 

Note: Cycle Count 

See subsection 9.5.2 on page 9-24 for operand ordering effects on cycle 
count. 

i i 
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POP Pop Integer 



Syntax 
Operation 
Operands 
Encoding 



Description 



Cycles 
Status Bits 



Mode Bit 
Example 



POP dst 

*SP > dst 

dst register (Rn, 0 * n * 27) 



31 


24 23 




16 15 87 0 


r T'— 

0 0 0 


— t t-t-t"t " 

0 1 110 0 


1 

0 1 


1 1 I 1 

dst 


I I I I I I I 1 I I I I 1 I I 

0000000000000000 



The top of the current system stack is popped and loaded into the dst register 
(32 LSBs). The top of the stack is assumed to be a signed integer. The POP 
is performed with a postdecrement of the stack pointer. The exponent bits of 
an extended precision register (R7-R0) are left unmodified. 

1 

These condition flags are modified only if the destination register is R7-R0. 
LUF Unaffected 
LV Unaffected 
UF 0 

N 1 if a negative result is generated; 0 otherwise 
Z 1 if a 0 result is generated; 0 otherwise 
V 0 
C Unaffected 

OVM Operation is not affected by OVM bit value. 
POP R3 

Before Instruction: 



SP = 809856h 

R3 = 012DAh = 4,826 

Data at 809856h = FFFF0DA4h = - 62,044 

LUF LV UF N Z V C = 0 0 0 0 0 

After Instruction: 



0 0 



SP = 809855h 

R3 = 0FFFF0DA4h = -62,044 

Data at 809856h = FFFF0DA4h = - 62,044 

LUF LV UF N Z V C = 0 0 0 1 0 0 0 
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Pop Floating Point POPF 



Syntax 
Operation 
Operands 
Encoding 



Description 



Cycles 
Status Bits 



Mode Bit 
Example 



POPF dst 

*SP > dst1 

dst register (Rn, 0 * n & 7) 



31 




24 23 


16 15 87 0 


■ r r 

0 0 0 


T"l 

0 1 


1 1 1 

110 1 


i 

0 1 


1 1 1 1 

dst 


I \ I I I I I I I I I i I I I 
0000 000000000000 



The top of the current system stack is popped and loaded into the dst register 
(32 MSBs). The top of the stack is assumed to be a floating-point number. The 
POP is performed with a postdecrement of the stack pointer. The eight LSBs 
of an extended precision register (R7-R0) are 0 filled. 

1 

These condition flags are modified only if the destination register is R7-R0. 

LUF Unaffected 

UF 0 

LV Unaffected 

N 1 if a negative result is generated; 0 otherwise 

Z 1 if a 0 result is generated; 0 otherwise 

V 0 

C Unaffected 

OVM Operation is not affected by OVM bit value. 

POPF R4 

Before Instruction: 

SP = 80984Ah 

R4 = 025D2E0123h = 6.91186578e + 00 

Data at 80984Ah = 5F2C1302h = 5.32544007e + 28 

LUF LV UF N Z V C = 0 0 0 0 0 00 

After Instruction: 

SP = 809849h 

R4 = 5F2C1 30200h = 5.32544007e + 28 

Data at 80984Ah = 5F2C1302h = 5.32544007e + 28 

LUF LV UF N Z V C = 0 0 0 0 0 00 
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PUSH PUSH Integer 



Syntax 
Operation 
Operands 
Encoding 



Description 



Cycles 
Status Bits 



Mode Bit 
Example 



PUSH src 
src -* *++SP 

src register (Rn, 0 s n & 27) 



31 




24 23 




16 15 


87 


0 


1 1 1 
0 0 0 


1 1 

0 1 


f-'T 1™ 

1110 


1 

0 1 


l I l l 

src 


i i i i 

0 0 0 0 


II I I I I 1 

0 0 0 0 0 0 0 


11 1 ""1 11 1 1 

0 0 0 0 0 



The contents of the src register (32 LSBs) are pushed on the current system 
stack. The src is assumed to be a signed integer. The PUSH is performed with 
a preincrement of the stack pointer. The integer or mantissa portion of an ex- 
tended precision register (R7-R0) is saved with this instruction. 



1 

LUF 

LV 

UF 

N 

Z 

V 

c 



Unaffected 
Unaffected 
Unaffected 
Unaffected 
Unaffected 
Unaffected 
Unaffected 



OVM Operation is not affected by OVM bit value. 

PUSH R6 

Before Instruction: 

SP = 8098AEh 

R6 = 025C1 28081 h = 633,41 5,688 
Data at 8098AFh = Oh 

LUF LV UF N Z V C = 0 0 0 0 0 00 
After Instruction: 
SP = 8098AFh 

R6 = 025C1 28081 h = 633,415,688 

Data at 8098AFh = 5C1 28081 h = 1,544,716,417 

LUF LV UF N Z V C = 0 0 0 0 0 00 
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PUSH Floatin g Point PUSH F 



Syntax 
Operation 
Operands 
Encoding 



Description 



Cycles 
Status Bits 



Mode Bit 
Example 



PUSHF src 

src *++SP 

src register (Rn, 0 * n * 7) 



31 




24 23 




16 15 


87 


0 


T I - 

0 0 0 


1 1 

0 1 


i i i 
1111 


0 1 


1 1 1 1 

src 


I I I I 

0 0 0 0 


— i — i — r t nr 

0 0 0 0 0 


— i™i — i — i — i — r~ 

0 0 0 0 0 0 0 



The contents of the src register (32 MSBs) are pushed on the current system 
stack. The src is assumed to be a floating-point number. The PUSH is per- 
formed with a preincrement of the stack pointer. The eight LSBs of the mantis- 
sa are not saved. (Note the difference in R2 and the value on the stack in the 
example below.) 



1 




LUF 


Unaffected 


LV 


Unaffected 


UF 


Unaffected 


N 


Unaffected 


Z 


Unaffected 


V 


Unaffected 


c 


Unaffected 



OVM Operation is not affected by OVM bit value. 

PUSHF R2 

Before Instruction: 

SP = 809801 h 

R2 = 025C1 28081 h = 6.87725854e + 00 
Data at 809802h = Oh 

LUF LV UF N Z V C = 0 0 0 0 0 00 
After Instruction: 

SP = 809802h 

R2 = 025C1 28081 h = 6.87725854e + 00 

Data at 809802h = 025C1280h = 6.87725830e + 00 

LUF LV UF N Z V C = 0 0 0 0 0 00 
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RETIcond^ 



Syntax 
Operation 



Operands 
Encoding 



RETlcond 

If carta is true: 

*SP *PC 

1 -+ ST (GIE). 

Else, continue. 

None 



31 


24 23 


1615 87 0 


i i i i 

0 1111 


i i i 

0 0 0 0 


1 

0 0 


l I l l 

cond 


i i i i i i i i i i i i i i i 

0000 000000000000 



Description 



Cycles 
Status Bits 



A conditional return is performed. If the condition is true, the top of the stack 
is popped to the PC, and a 1 is written to the global interrupt enable (GIE) bit 
of the status register. This has the effect of enabling all interrupts for which the 
corresponding interrupt enable bit is a 1 . 

The TMS320C3x provides 20 condition codes that can be used with this in- 
struction (see Table 10-9 on page 10-13 for a list of condition mnemonics, 
condition codes, and flags). Condition flags are set on a previous instruction 
only when the destination register is one of the extended-precision registers 
(R7-RQ) or when one of the compare instructions (CMPF, CMPF3, CMPI, 
CMPI3, TSTB, or TSTB3) is executed. 



4 




LUF 


Unaffected 


LV 


Unaffected 


UF 


Unaffected 


N 


Unaffected 


Z 


Unaffected 


V 


Unaffected 


c 


Unaffected 



Mode Bit 



OVM Operation is not affected by OVM bit value. 
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Return From Interrupt Conditionally RETIcond 



Example retinz 

Before instruction: 

PC = 456h 
SP = 809830h 
ST = 0h 

Data at 809830h = 123h 

LUF LV UF N Z V C = 0 0 0 0 0 00 

After Instruction: 

PC = 123h 
SP = 80982Fh 
ST = 2000h 

Data at 809830h = 123h 

LUF LV UF N Z V C = 0 0 0 0 0 00 
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RETScond Return From Subroutine Conditionally 



Syntax 
Operation 

Operands 
Encoding 



RETScond 

If cond is true: 

*SP > PC. 

Else, continue. 

None 



31 




24 23 




16 


15 


87 


0 


■ t r " i 
0 1 1 


1 

1 1 


i i i 

0 0 0 1 


1 

0 0 


i i i i 

cond 


i i i i 

0 0 0 0 


T" T 1 "I" '1 1 1 

0 0 0 0 0 0 0 


II 1 1 

0 0 0 0 0 



Description 



Mode Bit 
Example 



A conditional return is performed. If the condition is true, the top of the stack 
is popped to the PC. 

The TMS320C3x provides 20 condition codes that you can use with this in- 
struction (see Table 1 0-9 on page -1 3 for a list of condition mnemonics, condi- 
tion codes, and flags). Condition flags are set on a previous instruction only 
when the destination register is one of the extended-precision registers (R7- 
R0) or when one of the compare instructions (CMPF, CMPF3, CMPI, CMPI3, 
TSTB, or TSTB3) is executed. 



Cycles 


4 




Status Bits 


LUF 


Unaffected 




LV 


Unaffected 




UF 


Unaffected 




N 


Unaffected 




Z 


Unaffected 




V 


Unaffected 




C 


Unaffected 



OVM Operation is not affected by OVM bit value. 

RETSGE 

Before Instruction: 

PC = 123h 

SP = 80983Ch 

Data at 80983Ch = 456h 

LUF LV UF N Z V C = 0 0 0 0 0 00 

After Instruction: 

PC = 456h 
SP = 80983Bh 
Data at 80983Ch = 456h 
LUF LV UF N Z V C = 0 



0 0 0 0 0 0 
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Round £lo^ 



Syntax 

Operation 

Operands 



Encoding 



RND src, dst 
rnd (src) -* dst 

src general addressing modes (G): 

0 0 register (Rn, 0 * n <; 7) 

0 1 direct 

1 0 indirect 
1 1 immediate 

dst register (Rn, 0 * n * 7) 



31 


24 23 




16 15 


87 


0 


' I 1 '1 11 
0 0 0 


1 1 1 1 1 

1 0 0 0 1 0 


r 

G 


I I I I 

dst 


i i i i i i i i i i i i i i i 

src 



Description 



Cycles 
Status Bits 



Mode Bit 
Example 



The result of rounding the src operand is loaded into the dst register.The src 
operand is rounded to the nearest single-precision floating-point value. If the 
src operand is exactly half-way between two single-precision values, it is 
rounded to the most positive value. 

1 

These condition flags are modified only if the destination register is R7-R0. 
LUF 1 if a floating-point underflow occurs; unchanged otherwise 
LV 1 if a floating-point overflow occurs; unchanged otherwise 
UF 1 if a floating-point underflow occurs or the src operand is 0; 
0 otherwise 

N 1 if a negative result is generated; 0 otherwise 
Z Unaffected 

V 1 if a floating-point overflow occurs; 0 otherwise 
C Unaffected 

OVM Operation is affected by OVM bit value. 

RND R5,R2 

Before Instruction: 

R5 = 0733C1 6EEFh = 1 .79755599e + 02 
R2 = 0h 

LUF LV UF N Z V C = 0 0 0 0 0 00 
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RND Round Floating Point 



After Instruction: 

R5 = 0733C1 6EEFh = 1 .79755599e + 02 
R2 = 0733C1 6F00h = 1 .797556006 + 02 
LUF LV UF N Z V C = 0 0 0 0 0 00 

i 1 1 • 1 — — i 

Note: BZUF Instruction 

If a BZ instruction is executed immediately following an RND instruction with 
a 0 operand, the branch is not performed because the zero flag is not set. 
To circumvent this problem, execute a BZUF instruction instead of a BZ 
instruction. 

i > — ^ ; 1 
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Rotate Left ROL 



Syntax 
Operation 
Operands 
Encoding 



ROL dst 

dst left-rotated 1 bit -* dst 
dst register (Rn, 0 s n * 27) 



31 


24 23 




16 15 87 0 


— m — 

0 0 0 


— 1 1 1 — 1 — 1 — 

1 0 0 0 1 1 


— 1 — 

1 1 


iiii 
dst 


i i i i ill ii ffif fir 

0000000000000001 



Description 



Cycles 
Status Bits 



Mode Bit 
Example 



The contents of the dst operand are left-rotated one bit and loaded into the dst 
register. This is a circular rotation, with the MSB transferred into the LSB. 



Rotate left: 
C <- 



dst-<- 



1 

These condition flags are modified only if the destination register is R7-R0. 
LUF Unaffected 
LV Unaffected 
UF 0 

N MSB of the output 

Z 1 if a 0 output is generated; 0 otherwise 

V 0 

C Set to the value of the bit rotated out of the high-order bit. Unaffected 
if ctefis not R7-R0. 

OVM Operation is not affected by OVM bit value. 

ROL R3 

Before Instruction: 

R3 = 80025CD4h 

LUF LV UF N Z V C = 0 0 0 0 0 00 
After Instruction: 

R3 = 0004B9A9h 

LUF LV UF N Z V C = 0 0 0 0 0 0 1 
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ROLC Rotate Left Through Carry 



Syntax 
Operation 
Operands 
Encoding 



ROLC dst 

dst left-rotated one bit through carry bit -+ dst 
dst register (Rn, 0 * n * 27) 



1 1 

0 0 0 


10 0 10 0 


1 1 


dst 


1 1 — 1 — 1 — I— T" T — 1 — r-T-T-T — 1 — 1 — T" 

0000000000000001 



Description 



The contents of the dst operand are left-rotated one bit through the carry bit 
and loaded into the dst register. The MSB is rotated to the carry bit at the same 
time the carry bit is transferred to the LSB. 



Rotate left through carry bit: 

C — < — cfef-<- 
— » 



r 



Cycles 
Status Bits 



Mode Bit 
Example 1 



1 

These condition flags are modified only if the destination register is R7-R0. 
LUF Unaffected 
LV Unaffected 
UF 0 

N MSB of the output 

Z 1 if a 0 output is generated; 0 otherwise 

V 0 

C Set to the value of the bit rotated out of the high-order bit. If dst is not 
R7-R0, then C is shifted into the dst but not changed. 

OVM Operation is not affected by OVM bit value. 

ROLC R3 



Before Instruction: 



R3 = 00000420h 

LUF LV UF N Z V C = 0 0 0 0 0 0 1 
After Instruction: 

R3 = 000000841 h 

LUF LV UF N Z V C = 0 0 0 0 0 00 
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Rotate Left Through Carry ROLC 

Example 2 rolc R3 

Before Instruction: 

R3 = 80004281 h 

LUF LV UF N Z V C = 0 0 0 0 0 00 
After Instruction: 

R3 = 00008502h 

LUF LV UF N Z V C = 0 0 0 0 0 0 1 
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ROR Rotate Right 



Syntax 
Operation 
Operands 
Encoding 



31 



Description 



ROR dst 

dst right-rotated one bit through carry bit -* dst 
dst register (Rn, 0 s n % 27) 



24 23 



16 15 



87 



1 I 






" 1"" I" I ! 


— 1 — 1 — 1 — 1 — 1 — 1 — 1 — 1 — 1 — 1 — \ — 1 — 1 — 1 — T" 


0 0 0 


10 0 10 1 


1 1 


dst 


1111111111111111 



The contents of the dst operand are right-rotated one bit and loaded into the 
dst register. The LSB is rotated into the carry bit and also transferred into the 
MSB. 



Rotate right: 



dst 



-> C 



Cycles 
Status Bits 



Mode Bit 
Example 



1 

These condition flags are modified only if the destination register is R7-R0. 
LUF Unaffected 
LV Unaffected 
UF 0 

N MSB of the output 

Z 1 if a 0 output is generated; 0 otherwise 

V 0 

C Set to the value of the bit rotated out of the high-order bit. Unaffected 
if c/sf is not R7-R0. 

OVM Operation is not affected by OVM bit value. 
ROR R7 

Before Instruction: 



R7 = 00000421b 

LUF LV UF N Z V C = 0 0 0 0 0 00 
After Instruction: 

R7 = 8000021 Oh 

LUF LV UF N Z V G = 0 0 0 1 0 0 1 
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Rotate Right Through Carry RORC 



Syntax RORC dst 

Operation dst right-rotated one bit through carry bit -* dst 

Operands dst register (Rn, 0 * n * 27) 

Encoding 



31 


24 23 


16 15 87 0 


0 0 0 


I I I I I 
10 0 110 


1 

1 1 


l I I I 

dst 


1 1 1 1 1 1 V I 1 I 1 1 1 1 1 

1111111111111111 



Description The contents of the dst operand are right-rotated one bit through the status 

register's carry bit. This could be viewed as a 33-bit shift. The carry bit value 
is rotated into the MSB of the dst t while at the same time the dsfLSB is rotated 
into the carry bit. 

Rotate right through carry bit: 



* — C — * — dst 

« 




Cycles 
Status Bits 



Mode Bit 
Example 



1 

These condition flags are modified only if the destination register is R7-R0. 
LUF Unaffected 
LV Unaffected 
UF 0 

N MSB of the output 

Z 1 if a 0 output is generated; 0 otherwise 

V 0 

C Set to the value of the bit rotated out of the high-order bit. If dst is not 
R7 - R0, then C is shifted in but not changed. 

OVM Operation is not affected by OVM bit value. 

RORC R4 



Before Instruction: 



R4 = 80000081 h 

LUF LV UF N Z V C = 0 0 0 1 0 0 0 
After Instruction: 
R4 = 40000040h 

LUF LV UF N Z V C = 0 0 0 0 0 0 1 
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RPTB Repeat Block 



Syntax 
Operation 

Operands 
Encoding 



RPTB src 

src RE 
1 - ST (RM) 
Next PC -> RS 

src long-immediate addressing mode 



31 



t — i — i — r 

0 11 0 0 10 0 



24 23 



Description 



Mode Bit 
Example 



"i — i — r 



i — r 



16 15 



i — i — i — r 

src 



87 



i — r 



RPTB allows a block of instructions to be repeated a number of times without 
any penalty for looping. This instruction activates the block repeat mode of up- 
dating the PC. The src operand is a 24-bit unsigned immediate value that is 
loaded into the repeat end address (RE) register. A 1 is written into the repeat 
mode bit of status register ST (RM) to indicate that the PC is being updated 
in the repeat mode. The address of the next instruction is loaded into the repeat 
start address (RS) register. 



Cycles 


4 




Status Bits 


LUF 


Unaffected 




LV 


Unaffected 




UF 


Unaffected 




N 


Unaffected 




Z 


Unaffected 




V 


Unaffected 




C 


Unaffected 



OVM Operation is not affected by OVM bit value. 

RPTB 12 7h 

Before Instruction: 

PC = 123h 
ST = 0h 
RE = Oh 
RS = 0h 

LUF LV UF N Z 
After Instruction: 

PC = 124h 
ST = 100h 
RE = 127h 
RS = 124h 
LUF LV UF N 



VC = 0000000 



ZVC = 000 0 0 00 
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Repeat Single RPTS 



Syntax 
Operation 



Operands 



Encoding 



Description 



Cycles 
Status Bits 



RPTS src 

src -* RC 
1 ST (RM) 
1 -*S 

Next PC -* RS 
Next PC -> RE 

src general addressing modes (G): 

0 0 register 

0 1 direct 

1 0 indirect 
1 1 immediate 



31 


24 23 


16 15 


87 


0 


" "T T 
0 0 0 


10 0 111 


1 

G 


I 1 I I 

110 11 


I I I I I l l l I I l l l l l 

src 



The RPTS instruction allows you to repeat a single instruction a number of 
times without any penalty for looping. Fetches can also be made from the in- 
struction register (IR), thus avoiding repeated memory access. 

The src operand is loaded into the repeat counter (RC). A 1 is written into the 
repeat mode bit of the status register ST (RM). A 1 is also written into the re- 
peat single bit (S) . This indicates that the program fetches are to be performed 
only from the instruction register. The next PC is loaded into the repeat end 
address (RE) register and the repeat start address (RS) register. 

For the immediate mode, the src operand is assumed to be an unsigned inte- 
ger and is not sign-extended. 



4 




LUF 


Unaffected 


LV 


Unaffected 


UF 


Unaffected 


N 


Unaffected 


Z 


Unaffected 


V 


Unaffected 


c 


Unaffected 



Mode Bit 



OVM Operation is not affected by OVM bit value. 
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RPTS Repeat Single 



Example rpts AR5 

Before Instruction: 

PC = 123h 
ST = 0h 
RS = 0h 
RE = Oh 
RC = Oh 
AR5 = OFFh 

LUF LV UF N Z V C = 0 0 0 0 0 00 

After Instruction: 

PC = 124h 
ST= 100h 
RS = 124h 
RE = 124h 
RC = OFFh 
AR5 = OFFh 

LUF LV UF N Z V C - 0 0 0 0 0 00 
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Signal, Interl ocked SIGI 



Syntax 
Operation 

Operands 
Encoding 



31 



n — r~ 

0 0 0 



Description 



Cycles 
Status Bits 



Mode Bit 
Example 



SIGI 

Signal interlocked operation. 
Wait for interlock acknowledge. 
Clear interlock. 

None 



24 23 



i — i — i — r 

10 110 0 



0 0 



16 15 



— i — i — i — i — i — i — i — i — i — r 
oooooooo oo 



87 0 

-m — i — i — i — rnr^THTT i 

0000000000 0 



An interlocked operation is signaled over XF0 and XF1 . After the interlocked 
operation is acknowledged, the interlocked operation ends. SIGI ignores the 
external ready signals. Refer to Section 6.4 on page 6-1 2 for detailed informa- 
tion. 



1 

LUF 

LV 

UF 

N 

Z 

V 

c 



Unaffected 
Unaffected 
Unaffected 
Unaffected 
Unaffected 
Unaffected 
Unaffected 



OVM Operation is not affected by OVM bit value. 

SIGI ; The processor sets XF0 to 0, idles 

; until XF1 is set to 0, and then 
; sets XF0 to 1. 
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STF Store Floating Point 



Syntax STF src, dst 

Operation src -* dst 

Operands src register (Rn, 0sn^7) 



dst general addressing modes (G): 
0 1 direct 
1 0 indirect 



Encoding 



31 


24 23 






16 15 


87 


0 


0 0 0 


" t — r t~ i — r™ 

10 10 0 0 


1 

G 


I 1 I I 

src 


I l I l I I l I l I l l I l I 

dst 



Description 

Cycles 
Status Bits 



Mode Bit 
Example 



The src register is loaded into the dst memory location. The src and dst oper- 
ands are assumed to be floating-point numbers. 



LUF 


Unaffected 


LV 


Unaffected 


UF 


Unaffected 


N 


Unaffected 


Z 


Unaffected 


V 


Unaffected 


c 


Unaffected 



OVM Operation is not affected by OVM bit value. 

STF R2,@98Alh 



Before Instruction: 



DP = 80h 

R2 = 052C501900h = 4.30782204e + 01 
Dataat8098A1h = 0h 

LUF LV UF N Z V C = 0 0 0 0 0 00 



After Instruction: 



DP = 80h 

R2 = 052C501900h = 4.30782204e + 01 

Data at 8098A1h = 52C5019h = 4.30782204e + 01 

LUF LV UF N Z V C = 0 0 0 0 0 00 
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Store Float ing Point, ^ /n fe^oc/rec/^ ||( STFI 



Syntax 
Operation 

Operands 



Encoding 



31 



Description 



Cycles 
Status Bits 



Mode Bit 
Example 



STFI src t dst 
src -* dst 

Signal end of interlocked operation. 

src register (Rn, 0 * n * 7) 

dst general addressing modes (G): 
0 1 direct 
1 0 indirect 



24 23 



16 15 



87 



1 1 


i i r i i 




r t j— t 


"""I 1 I T T"-T 


i i i i "i i i r i 


0 0 0 


10 10 0 1 


G 


src 




dst 



The src register is loaded into the dst memory location. An interlocked opera- 
tion is signaled over pins XFO and XF1 . The src and dst operands are assumed 
to be floating-point numbers. Refer to Section 6.4 on page 6-12 for detailed 
information. 

1 



LUF 


Unaffected 


LV 


Unaffected 


UF 


Unaffected 


N 


Unaffected 


Z 


Unaffected 


V 


Unaffected 


c 


Unaffected 



OVM Operation is not affected by OVM bit value. 

STFI R3 , *-AR4 

Before Instruction: 

R3 = 0733C00000h = 1 .79750e + 02 
AR4 = 80993Ch 
Data at 80993Bh = Oh 

LUF LV UF N Z V C = 0 0 0 0 0 00 
After Instruction: 

R3 = 0733C00000h = 1 .79750e + 02 
AR4 = 80993Ch 

Data at 80993Bh = 733C000h = 1 .797506 + 02 
LUF LV UF N Z V C = 0 0 0 0 0 00 
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STFIISTF Parallel Sto re Floating Point 



Syntax 

Operation 

Operands 



Encoding 



STF src2, dst2 
|| STF srd, dst1 

src2-» dst2 
|| srd dst1 

srd register (Rn1 , 0 * n1 * 7) 
dst1 indirect (disp = 0, 1, IRQ, IR1) 
src2 register (Rn2, 0 ss n2 ss 7) 
dst2 indirect (disp = 0, 1 , IRO, IR1) 



31 




24 23 




16 15 




87 




0 


s 

1 1 


I 1 S 1 

0 0 0 0 0 


i i 
src2 


i i 

0 0 0 


i i 

srd 


i i i i i i i 

dsn 


I I I l R I I 

dst2 



Description 

Cycles 
Status Bits 



Mode Bit 
Example 



Two STF instructions are executed in parallel. Both srd and src2 are assumed 
to be floating-point numbers. 



1 



LUF 


Unaffected 


LV 


Unaffected 


UF 


Unaffected 


N 


Unaffected 


Z 


Unaffected 


V 


Unaffected 


c 


Unaffected 



OVM Operation is not affected by OVM bit value. 

STF R4 , *AR3 — 
| | STF R3,*++AR5 

Before Instruction: 

R4 = 070C800000h = 1 .4050e + 02 
AR3 = 809835h 

R3 = 0733C00000h = 1.79750e + 02 
AR5 = 8099D2h 
Data at 809835h = Oh 
Data at 8099D3h = Oh 

LUF LV UF N Z V C = 0 0 0 0 0 00 
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Parallel Store Floating Point STF|jSTF 



After Instruction: 

R4 = 070C800000h = 1 .4050e + 02 
AR3 = 809834h 

R3 = 0733C00000h = 1 .79750e + 02 
AR5 = 8099D3h 

Data at 809835h = 070C8000h = 1 .4050e + 02 
Data at 8099D3h = 0733C000h = 1 .79750e + 02 
LUF LV UF N Z V C = 0 0 0 0 0 00 

i 1 

Note: Cycle Count 

See subsection 9.5.2 on page 9-24 for operand ordering effects on cycle 
count. 

i i 
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ST1 Store Integer 



Syntax 

Operation 

Operands 



Encoding 



Description 

Cycles 
Status Bits 



Mode Bit 
Example 



STI src, dst 
src -* dst 

src register (Rn, 0 as n a: 27) 

dst general addressing modes (G): 
0 1 direct 
1 0 indirect 



31 


24 23 




16 15 




87 






0 




" i "i 111 r j i 




I l l I 


i i 


i i i 




T "T~l 


nrn 


i — r~ 


0 0 0 


10 10 10 


G 


src 






dst 









The src register is loaded into the dst memory location. The src and dst oper- 
ands are assumed to be signed integers. 



LUF 


Unaffected 


LV 


Unaffected 


UF 


Unaffected 


N 


Unaffected 


Z 


Unaffected 


V 


Unaffected 


c 


Unaffected 



OVM Operation is not affected by OVM bit value. 
STI R4,§982Bh 
Before Instruction: 



DP = 80h 

R4 = 42BD7h = 273,367 

Data at 80982Bh = 0E5FCh = 58,876 

LUF LV UF N Z V C = 0 0 0 0 



0 0 0 



After Instruction: 



DP = 80h 

R4 = 42BD7h * 273,367 

Data at 80982Bh = 42BD7h = 273,367 

LUF LV UF N Z V C = 0 0 0 0 0 



0 0 
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Store Integer, Interlocked STII 



Syntax 
Operation 

Operands 



Encoding 



STII src, dst 
src -+ dst 

Signal end of interlocked operation 

src register (Rn, 0 as n * 27) 

dst general addressing modes (G): 
0 1 direct 
1 0 indirect 



31 


24 23 


16 15 


87 


0 


"'T T ~ 
0 0 0 


1 I 1 1 1 

10 10 11 


1 

G 


I I I 1 

src 


I I I I I I I I I I I I I I I 

dst 



Description The src register is loaded into the dst memory location. An interlocked opera- 

tion is signaled over pins XFO and XF1 . The src and dst operands are assumed 
to be signed integers. Refer to Section 6.4 on page 6-1 2 for detailed informa- 
tion. 



Cycles 
Status Bits 



Mode Bit 
Example 



1 

LUF 

LV 

UF 

N 

Z 

V 

c 



Unaffected 
Unaffected 
Unaffected 
Unaffected 
Unaffected 
Unaffected 
Unaffected 



OVM Operation is not affected by OVM bit value. 

STII Rl,@98AEh 

Before Instruction: 



DP = 80h 
R1 = 78Dh 

Data at 8098AEh = 25Ch 

After Instruction: 

DP = 80h 
R1 = 78Dh 

Data at 8098AEh = 78Dh 
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STI1ISTI Parallel STI and STI 



Syntax 

Operation 

Operands 

Encoding 



31 



STI 
STI 



src2, d$t2 
srd, dst1 



src2 -* dst2 
|| srd -* dst1 

srd register (Rn1 , 0 & n1 * 7) 
dst1 indirect (disp = 0, 1 , IRO, IR1) 
$rc2 register (Rn2, 0 s n2 * 7) 
dst2 indirect (disp * 0, 1 , IRO, IR1) 



24 23 



16 15 



87 





t-t— t-t— 


■n — 


■— j— 1 — 


T'— r- 


'"■ n — r ,lwi r t-tt— 


i i i i i i i 


1 1 


0 0 0 0 1 


srdZ 


0 0 0 


srd 







Description 

Cycles 
Status Bits 



Mode Bit 
Example 



Two integer stores are performed in parallel. If both stores are executed to the 
same address, the value written is that of STI src2, dst2. 

1 



LUF 


Unaffected 


LV 


Unaffected 


UF 


Unaffected 


N 


Unaffected 


Z 


Unaffected 


V 


Unaffected 


c 


Unaffected 



OVM Operation is not affected by OVM bit value. 

STI R0,*++AR2(IR0) 
j | STI R5,*AR0 



Before Instruction: 



RO = ODCh = 220 
AR2 = 809830h 
IRO = 8h 
R5 = 35h = 53 
ARO = 8098D3h 
Data at 809838h = Oh 
Data at 8098D3h = Oh 

LUF LV UF N Z V C = 0 0 0 0 0 00 
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Parallel STI and STI STI| |STI 



After Instruction: 

RO = ODCh = 220 

AR2 = 809838h 

IRO = 8h 

R5 = 35h = 53 

ARO = 809803h 

Data at 809838h = ODCh = 220 

Data at 8098D3h = 35h = 53 

LUF LV UF N Z V C = 0 0 0 0 0 00 



Note: Cycle Count 

See subsection 9.5.2 on page 9-24 for operand ordering effects on cycle 
count. 

■ ■ 
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SUBB Subtract Integ er With Borrow 



Syntax 

Operation 

Operands 



Encoding 



SUBB src, dst 
dst - src -C -* dst 

src general addressing modes (G): 

0 0 register (Rn, 0 * n s 27) 

0 1 direct 

1 0 indirect 
1 1 immediate 

dst register (Rn, 0 * n * 27) 



31 




24 23 




16 15 


87 


0 


■ "T "T " 
0 0 0 


1 "1 1 

1 0 1 


"T T " 
1 0 1 


G 


1 I I 1 

dst 


l I I I I I l I I l I l l l I 

src 



Description 

Cycles 
Status Bits 



Mode Bit 
Example 



The difference of the dst, src, and C operands is loaded into the dst register. 
The dst and src operands are assumed to be signed integers. 

1 

These condition flags are modified only if the destination register is R7-R0. 

LUF Unaffected 

LV 1 if an integer overflow occurs; unchanged otherwise 

UF 0 

N 1 if a negative result is generated; 0 otherwise 

Z 1 if a 0 result is generated; 0 otherwise 

V 1 if an integer overflow occurs; 0 otherwise 

C 1 if a borrow occurs; 0 otherwise 

OVM Operation is affected by OVM bit value. 

SUBB *AR5++(4),R5 

Before Instruction: 



AR5 = 809800h 

R5 = OFAh = 250 

Data at 809800h = 0C7h = 1 99 

LUF LV UF N Z V C = 0 0 0 0 0 0 1 



After Instruction: 



AR5 = 809804h 

R5 = 032h = 50 

Data at 809800h = 0C7h = 1 99 

LUF LV .UF N Z V C » 0 0 0 0 0 0 0 
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Subtract Integer With Borrow, 3-Operand SU BB3 



Syntax 

Operation 

Operands 



Encoding 



31 



Description 

Cycles 
Status Bits 



SUBB3 src2, srrt, dst 

srd - src2- C -* dst 

srd three-operand addressing modes (T): 
0 0 register (Rn1 , 0 s n1 * 27) 

0 1 indirect (disp = 0, 1 , IRO, IR1) 

1 0 register (Rn1 , 0 * n1 s 27) 

1 1 indirect (disp = 0, 1 , IRO, IR1) 

src2 three-operand addressing modes (T): 
0 0 register (Rn2, 0 * n2 * 27) 

0 1 register (Rn2, 0 s n2 s 27) 

1 0 indirect (disp = 0, 1 , IRO, IR1) 
1 1 indirect (disp = 0, 1 , IRO, IR1) 

dst register (Rn, 0 * n * 27) 



24 23 



16 15 



87 





"i t— m — r— 


— 1 — 


i i i i 


i i i i i i i 


T t — r — i — i — 1 — T" 


0 0 1 


0 0 1 10 0 


T 


dst 


srd 


src2 



Mode Bit 



The difference of the srd and src2 operands and the C flag is loaded into the 
dst register. The srd, src2 t and dst operands are assumed to be signed inte- 
gers. 

1 

These condition flags are modified only if the destination register is R7-R0. 

LUF Unaffected 

LV 1 if an integer overflow occurs; unchanged otherwise 

UF 0 

N 1 if a negative result is generated; 0 otherwise 

Z 1 if a 0 result is generated; 0 otherwise 

V 1 if an integer overflow occurs; 0 otherwise 

C 1 if a borrow occurs; 0 otherwise 

OVM Operation is affected by OVM bit value. 
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SU BB3 Subtract Integer With Borrow, 3-Operand 



Example subb3 r5,*ars++(iro) ,ro 

Before Instruction: 

AR5 = 809800h 
IRO = 4h 

R5 = 0C7h = 199 
R0 = 0h 

Data at 809800h = OFAh = 250 

LUF LV UF N Z V C = 0 0 0 0 0 0 1 

After Instruction: 

AR5 = 809804h 
IRO = 4h 

R5 = 0C7h = 199 
RO = 32h = 50 

Data at 809800h = OFAh = 250 

LUF LV UF N Z V C = 0 0 0 0 0 00 

i i 
Note: Cycle Count 

See subsection 9.5.2 on page 9-24 for operand ordering effects on cycle 
count. 

■ ■ 
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Subtract integer Conditionally SUBC 



Syntax 
Operation 



Operands 



Encoding 



31 



SUBC src, dst 

If (dst- src ±0): 

(dst- src« 1) OR 1 dst 

Else: 

dst« 1 -* dst 

src general addressing modes (G): 

0 0 register (Rn, 0 * n * 27) 

0 1 direct 

1 0 indirect 
1 1 immediate 

dst register (Rn, 0 * n as 27) 



24 23 



16 15 



87 



T r ■■ 


"7 1 — 1 — 1 1 


— T— 


— i — i — i — r 


— r— i — i — i — i — i — i — i — i — i — r 


™i — i — i — r~ 


0 0 0 


1 0 1 110 


G 


dst 


stc 





Description 



Cycles 
Status Bits 



The srcoperand is subtracted from the dst operand. The asroperana is loaded 
with a value dependent on the result of the subtraction. If (dst- src) is greater 
than or equal to 0, then (dst- src) is left-shifted one bit, the least significant 
bit is set to 1 , and the result is loaded into the dst register. If (ast - src) is less 
than 0, dst is left-shifted one bit and loaded into the dst register. The dst and 
src operands are assumed to be unsigned integers. 

You can use SUBC to perform a single step of a multibit integer division. See 
subsection 1 1 .3.4 on page 1 1 -26 for a detailed description. 

1 



LUF 


Unaffected 


LV 


Unaffected 


UF 


Unaffected 


N 


Unaffected 


Z 


Unaffected 


V 


Unaffected 


c 


Unaffected 



Mode Bit 



OVM Operation is not affected by OVM bit value. 
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SU BC Subtract Integer Conditionally 



Example 1 subc 698C5h,Ri 

Before Instruction: 
DP = 80h 

R1 =04F6h = 1270 

Data at 8098C5h = 492h = 1170 

LUF LV UF N Z V C = 0 0 0 0 0 00 

After Instruction: 

DP = 80h 

R1 = 0C9h = 201 

Data at 8098C5h = 492h = 1170 

LUF LV UF N Z V C = 0 0 0 0 0 00 

Example 2 SUBC 3000, R0 (3000 - 0BB8h) 

Before Instruction: 

RO = 07D0h = 2000 

LUF LV UF N Z V C = 0 0 0 0 0 00 
After Instruction: 
RO = OFAOh = 4000 

LUF LV UF N Z V C = 0 0 0 0 0 00 
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Syntax 

Operation 

Operands 



Encoding 



Description 

Cycles 
Status Bits 



Mode Bit 
Example 



SUBF src, dst 
dst -src-* dst 

src general addressing modes (G): 

0 0 register (Rn, 0 * n * 7) 

0 1 direct 

1 0 indirect 
1 1 immediate 

dst register (Rn, 0 s: n * 7) 



31 


24 23 




16 15 


87 


0 


1 1 

0 0 0 


i r r r™ r 

10 1111 


i 

G 


i i i i 

dst 


i i i i i i i i i i i i i i i 

src 



The difference of the dst operand minus the src operand is loaded into the 
dst register. The dst and src operands are assumed to be floating-point num- 
bers. 

1 

These condition flags are modified only if the destination register is R7-R0. 

LUF 1 if a floating-point underflow occurs; unchanged otherwise 

LV 1 if a floating-point overflow occurs; unchanged otherwise 

UF 1 if a floating-point underflow occurs; 0 otherwise 

N 1 if a negative result is generated; 0 otherwise 

Z 1 if a 0 result is generated; 0 otherwise 

V 1 if a floating-point overflow occurs; 0 otherwise 

C Unaffected 

OVM Operation is not affected by OVM bit value. 

SUBF *AR0 — (IR0),R5 

Before Instruction: 

ARO = 809888h 
IRO = 80h 

R5 = 0733C00000h = 1 .79750000e + 02 
Data at 809888h = 70C8000h = 1 .4050e + 02 
LUF LV UF N Z V C = 0 0 0 0 0 00 

After Instruction: 

ARO = 809808h 
IRO = 80h 

R5 = 051 DOOOOOOh = 3.9250e + 01 

Data at 809888h = 70C8000h = 1 .4050e + 02 

LUF LV UF N Z V C = 0 0 0 0 0 00 



Assembly Language Instructions 1 0- 1 87 



SU I BF3 Subtra^H Point, S-Operand 



Syntax 

Operation 

Operands 



Encoding 



SUBF3 src2, srd, dst 
srd - src2-+ dst 

srd three-operand addressing modes (T): 
0 0 register (Rn1, * n1 ss7) 

0 1 indirect (disp = 0, 1 , IRO, IR1) 

1 0 register (Rn1 , * n1 ss 7) 

1 1 indirect (disp = 0,1, IRO, !R1) 

src2 three-operand addressing modes (T): 
0 0 register (Rn2, * n2 ss 7) 

0 1 register (Rn2, * n2 * 7) 

1 0 indirect (disp = 0, 1 , IRO, IR1) 
1 1 indirect (disp = 0, 1 , IRO, IR1) 

dst register (Rn, 0 * n & 7) 



31 


24 23 




16 15 




87 




0 


0 0 1 


0 0 1 10 1 


1 

T 


i i i i 

dst 


i i i i i i i 

srd 


I l l I l I I 

src2 



Description 

Cycles 
Status Bits 



The difference of the srd and src2 operands is loaded into the dst register. 
The srd, src2 t and dst operands are assumed to be floating-point numbers. 

1 

These condition flags are modified only if the destination register is R7-R0. 

LUF 1 if a floating-point underflow occurs; unchanged otherwise 

LV 1 if a floating-point overflow occurs; unchanged otherwise 

UF 1 if a floating-point underflow occurs; 0 otherwise 

N 1 if a negative result is generated; 0 otherwise 

Z 1 if a 0 result is generated; 0 otherwise 

V 1 if a floating-point overflow occurs; 0 otherwise 

C Unaffected 



Mom Bit 



OVM Operation is not affected by OVM bit value. 
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Subtract Floating Point, 3-Operand SUBF 3 



Example 1 



SUBF3 *ARO — (IRO) ,*AR1,R4 



Before Instruction: 

ARO = 809888h 
IRO = 80h 
AR1 = 809851 h 
R4 = 0h 

Data at 809888h = 70C8000h = 1 .4050e + 02 
Data at 809851 h = 733C000h = 1 .79750e + 02 
LUFLVUFNZVC = 000 0 0 00 

After Instruction: 

ARO = 809808h 
IRO = 80h 
AR1 = 809851 h 

R4 = 51 DOOOOOOh = 3.92506 + 01 
Data at 809888h = 70C8000h = 1 .4050e + 02 
Data at 809851 h = 733C000h = 1 .79750e + 02 
LUF LV UF N Z V C = 0 0 0 0 0 00 



Before Instruction: 

R7 = 57B400000h = 6.281 250e + 01 
RO = 34C200000h = 1 .275781 25e + 01 
R6 = 0h 

LUF LV UF N Z V C = 0 0 0 0 0 00 

After instruction: 

R7 = 57B400000h = 6.281 250e + 01 

RO = 34C200000h = 1. 275781 25e + 01 

R6 = 5B7C80000h = -5.00546875e + 01 

LUF LV UF N Z V C = 0 0 0 0 1 0 0 

i 1 

Note: Cycle Count 

See subsection 9.5.2 on page 9-24 for operand ordering effects on cycle 
count. 



Example 2 



SUBF3 R7,R0,R6 



Assembly Language Instructions 
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SUBF3|1S TF Parallel SU BF3 and STF 



Syntax 

Operation 

Operands 



Encoding 



31 



T — 
1 1 



SUBF3 src1,src2,dst1 
STF src3, dst2 

src2- srd -* dst1 
src3 -+ dst2 



srd 
src2 
dst1 
src3 
dst2 



register (Rn1, 0 * n1 * 7) 
indirect (disp = 0, 1, IRO, IR1) 
register (Rn2, 0 * n2 s 7) 
register (Rn3, 0 s n3 s 7) 
indirect (disp = 0, 1, IRO, IR1) 



— , — , — p- 

0 10 1 



24 23 



1 — r 

dsn 



16 15 



i r 

srd 



src3 



i — i — i — r 

dst2 



87 



t — i — i — r 

src2 



Description 



Cycles 
Status Bits 



A floating-point subtraction and a floating-point store are performed in parallel. 
All registers are read at the beginning and loaded at the end of the execute 
cycle. This means that if one of the parallel operations (STF) reads from a reg- 
ister and the operation being performed in parallel (SUBF3) writes to the same 
register, STF accepts as input the contents of the register before it is modified 
by the SUBF3. 

If src3 and dst1 point to the same location, src3 is read before the write to dsth 
1 

These condition flags are modified only if the destination register is R7-R0. 

LUF 1 if a floating-point underflow occurs; unchanged otherwise 

LV 1 if a floating-point overflow occurs; unchanged otherwise 

UF 1 if a floating-point underflow occurs; 0 otherwise 

N 1 if a negative result is generated; 0 otherwise 

Z 1 if a 0 result is generated; 0 otherwise 

V 1 if a floating-point overflow occurs; 0 otherwise 

C Unaffected 



Mode Bit 



OVM Operation is not affected by OVM bit value. 
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Parallel SUBF3 and STF SUBF3||STF 



Example subf3 ri , *-ar4 ( iri ) , ro 

| | STF R7,*+AR5(IR0) 

Before Instruction: 

R1 = 057B400000h = 6.281 25e + 01 
AR4 = 8098B8h 
IR1 = 8h 
R0 = 0h 

R7 = 0733C00000h = 1 .797506 + 02 
AR5 = 809850h 
IR0 = 10h 

Data at 8098B0h = 70C8000h = 1 .4050e + 02 
Data at 809860h = Oh 

LUF LV UF N Z V C = 0 0 0 0 0 00 
After Instruction: 



R1 = 057B400000h = 6.281 25e + 01 
AR4 = 8098B8H 
IR1 = 8h 

R0 = 061 B600000h = 7.768750e + 01 
R7 = 0733C00000h = 1 .79750e + 02 
AR5 = 809850h 
IR0 = 10h 

Data at 8098B0h = 70C8000h = 1 .4050e + 02 
Data at 809860h = 733C000h = 1 .79750a + 02 
LUF LV UF N Z V C = 0 0 0 0 0 00 

i 1 

Note: Cycle Count 

See subsection 9.5.2 on page 9-24 for operand ordering effects on cycle 
count. 
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SUBI -frr Su^ 



Syntax 

Operation 

Operands 



Encoding 



31 



Description 

Cycles 
Status Bits 



Mode Bit 
Example 



SUBI src, dst 
dst- src-* dst 

src general addressing modes (G): 

0 0 register (Rn, 0 * n s 27) 

0 1 direct 

1 0 indirect 
1 1 immediate 

dst register (Rn, 0 s n js 27) 



24 23 



16 15 



87 





i i i r T 


i 


1 I I 1 


1 r i r t "i -"i i— i r™ 


I I ii r t 


0 0 0 


1 1 0 0 0 0 


G 


dst 


src 





The difference of the dst operand minus the src operand is loaded into the dst 
register. The dst and src operands are assumed to be signed integers. 

1 

These condition flags are modified only if the destination register is R7-R0. 

LUF Unaffected 

LV 1 if an integer overflow occurs; unchanged otherwise 

UF 0 

N 1 if a negative result is generated; 0 otherwise 

Z 1 if a 0 result is generated; 0 otherwise 

V 1 if an integer overflow occurs; 0 otherwise 

C 1 if a borrow occurs; 0 otherwise 

OVM Operation is affected by OVM bit value. 

SUBI 220 ,R7 

Before Instruction: 



R7 = 226h = 550 

LUF LV UF N Z V C = 0 

After Instruction: 

R7 = 14Ah = 330 

LUF LV UF N Z V C = 0 



0 0 0 0 0 0 



0 0 0 0 0 0 
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Subtract Integer, 3-Operand SU BI3 



Syntax 

Operation 

Operands 



Encoding 



SUBI3 $rc2, srrf, dst 
srrf -src2-+ dst 

srrf three-operand addressing modes (T): 
0 0 register (Rn1 , 0 * n1 * 27) 

0 1 indirect (disp = 0, 1 , IRO, IR1) 

1 0 register (Rn1 , 0 s n1 s 27) 

1 1 indirect (disp = 0, 1, IRO , IR1) 

src2 three-operand addressing modes (T): 
0 0 register (Rn2, 0 * n2 * 27) 

0 1 register (Rn2, 0 * n2 * 27) 

1 0 indirect (disp = 0, 1, IRO, IR1) 
1 1 indirect (disp = 0,1, IRO, IR1) 

dst register (Rn, 0 * n sc 27) 



31 
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Description 

Cycles 
Status Bits 



Mode Bit 



The difference of the srrf operand minus the src2 operand is loaded into the 
dst register. The srrf, src2 % and dst operands are assumed to be signed inte- 
gers. 

1 

These condition flags are modified only if the destination register is R7-R0. 

LUF Unaffected 

LV 1 if an integer overflow occurs; unchanged otherwise 

UF 0 

N 1 if a negative result is generated; 0 otherwise 

Z 1 if a 0 result is generated; 0 otherwise 

V 1 if an integer overflow occurs; 0 otherwise 

C 1 if a borrow occurs; 0 otherwise 

OVM Operation is affected by OVM bit value. 
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SUBI3 Subtract Integer, 3-Operand 



Example 1 



Example 2 



SUBI3 R7,R2,R0 

Before Instruction: 

R2 = 0866h = 2150 
R7 = 0834h = 2100 
R0 = 0h 

LUF LV UF N Z V C = 0 0 0 
After Instruction: 



After Instruction: 

AR2 = 80985Eh 

R4 = 0226h = 550 

R3 = 014Ah = 330 

Data at 80985Dh = ODCh = 220 

LUF LV UF N Z V C = 0 0 0 



0 0 0 0 



R2 = 0866h = 2150 
R7 = 0834h = 2100 
R0 = 032h = 50 

LUF LV UF N Z V C = 0 0 0 1 
SUB I 3 *-AR2(l) ,R4,R3 

Before Instruction: 



AR2 = 80985Eh 
R4 = 0226h = 550 
R3 = 0h 

Data at 80985Dh = ODCh = 220 
LUF LV UF N Z V C = 0 0 0 



0 0 0 



0 0 0 0 



0 0 0 0 



Note: Cycle Count 

See subsection 9.5.2 on page 9-24 for operand ordering effects on cycle 
count. 

■ * 
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Parallel SUBI3 and STI SUBI3||STI 



Syntax SUBI3 srrf, src2, dst1 

|| STI src3, dst2 

Operation src2 -srrf -* dst1 

|| src3-+dst2 

Operands srrf register (Rn1 , 0 * n1 * 7) 

src2 indirect (disp = 0, 1 , IRO, IR1) 

dst1 register (Rn2, 0 s n2 * 7) 

src3 register (Rn3, 0 * n3 a: 7) 

dst2 indirect (disp = 0, 1 , IRO, IR1) 

Encoding 



31 




24 23 




16 


15 
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Description 



Cycles 
Status Bits 



Mode Bit 



An integer subtraction and an integer store are performed in parallel. All regis- 
ters are read at the beginning and loaded at the end of the execute cycle. This 
means that if one of the parallel operations (STI) reads from a register and the 
operation being performed in parallel (SUBI3) writes to the same register, STI 
accepts as input the contents of the register before it is modified by the SUBI3. 

If src3 and dst1 point to the same location, src3 is read before the write to dst 1 . 

1 

These condition flags are modified only if the destination register is R7-R0. 
LUF Unaffected 

LV 1 if an integer overflow occurs; unchanged otherwise 
UF 0 

N 1 if a negative result is generated; 0 otherwise 
Z 1 if a 0 result is generated; 0 otherwise 
V 1 if an integer overflow occurs; 0 otherwise 
C 1 if a borrow occurs; 0 otherwise 

OVM Operation is affected by OVM bit value. 
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SUBI3||STI Parallel SUBI3 and STI 



Example subi3 r7,*+ar2(iro) ,ri 

| | STI R3 , *++AR7 

Before Instruction: 



R7 = 14h = 20 

AR2 = 80982Fh 

IR0 = 10h 

R1 =0h 

R3 = 35h = 53 

AR7 = 80983Bh 

Data at 80983Fh = ODCh = 220 

Data at 80983Ch = Oh 

LUF LV UF N Z V C = 0 0 0 0 0 00 
After Instruction: 



R7 = 14h = 20 

AR2 = 80982Fh 

IR0 = 10h 

R1 = 0C8h = 200 

R3 = 35h = 53 

AR7 = 80983Ch 

Data at 80983Fh = ODCh = 220 

Data at 80983Ch = 35h = 53 

LUF LV UF N Z V C = 0 0 0 0 0 00 

i 1 

Note: Cycle Count 

See subsection 9.5.2 on page 9-24 for operand ordering effects on cycle 
count. 

i i 
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Subtract Reverse Integer With Borrow SU BR B 



Syntax 

Operation 

Operands 



Encoding 



SUBRB src, dst 
src-dst- C dst 

src general addressing modes (G): 

0 0 register (Rn, 0 * n * 27) 

0 1 direct 

1 0 indirect 
1 1 immediate 

dst register (Rn, 0 as n * 27) 



31 
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Description 

Cycles 
Status Bits 



Mode Bit 
Example 



The difference of the src, dst, and C operands is loaded into the dst register. 
The dst and src operands are assumed to be signed integers. 

1 

These condition flags are modified only if the destination register is R7-R0. 

LUF Unaffected 

LV 1 if an integer overflow occurs; unchanged otherwise 

UF 0 

N 1 if a negative result is generated; 0 otherwise 

Z 1 if a 0 result is generated; 0 otherwise 

V 1 if an integer overflow occurs; 0 otherwise 

C 1 if a borrow occurs; 0 otherwise 

OVM Operation is affected by OVM bit value. 

SUBRB R4,R6 

Before Instruction: 



R4 = 03CBh = 971 
R6 = 0258h = 600 
LUF LV UF N Z V C = 0 

After Instruction: 



0 0 0 0 0 1 



R4 = 03CBh = 971 
R6 = 0172h = 370 
LUF LV UF N Z V C 



= 0000000 
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SU BRF Subtract Reverse Floating Point 



Syntax 

Operation 

Operands 



Encoding 



SUBRF src, dst 
src -dst-* dst 

src general addressing modes (G): 

0 0 register (Rn, 0 * n s: 7) 

0 1 direct 

1 0 indirect 
1 1 immediate 

dst register (Rn, 0 * n <: 7) 



31 
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dst 
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src 



Description 

Cycles 
Status Bits 



Mode Bit 
Example 



The difference of the src operand minus the dst operand is loaded into the dst 
register. The dst and src operands are assumed to be floating-point numbers. 

1 

These condition flags are modified only if the destination register is R7-R0. 

LUF 1 if a floating-point underflow occurs; unchanged otherwise 

LV 1 if a floating-point overflow occurs; unchanged otherwise 

UF 1 if a floating-point underflow occurs; 0 otherwise 

N 1 if a negative result is generated; 0 otherwise 

Z 1 if a 0 result is generated; 0 otherwise 

V 1 if a floating-point overflow occurs; 0 otherwise 

C Unaffected 

OVM Operation is not affected by OVM bit value. 

SUBRF @9905h,R5 

Before Instruction: 



DP = 80h 

R5 = 057B400000h = 6.281 250e + 01 

Data at 809905h = 733C000h = 1 .79750e + 02 

LUF LV UF N Z V C = 0 0 0 0 0 00 



After Instruction: 



DP = 80h 

R5 = 0669E00000h = 1 .16937500e + 02 
Data at 809905h = 733C000h = 1 .79750e + 02 
LUF LV UF N Z V C = 0 0 0 0 0 00 
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Subtract Reverse Integer SU BR1 



Syntax 

Operation 

Operands 



Encoding 



SUBRI src, dst 
src -dst-*' dst 

src general addressing modes (G): 

0 0 register (Rn, 0 * n & 27) 

0 1 direct 

1 0 indirect 
1 1 immediate 

dst register (Rn, 0 * n * 27) 
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Description 

Cycles 
Status Bits 



Mode Bit 
Example 



The difference of the src operand minus the dst operand is loaded into the dst 
register. The dst and src operands are assumed to be signed integers. 

1 

These condition flags are modified only if the destination register is R7-R0. 

LUF Unaffected 

LV 1 if an integer overflow occurs; unchanged otherwise 

UF 0 

N 1 if a negative result is generated; 0 otherwise 

Z 1 if a 0 result is generated; 0 otherwise 

V 1 if an integer overflow occurs; 0 otherwise 

C 1 if a borrow occurs; 0 otherwise 

OVM Operation is affected by OVM bit value. 

SUBRI *AR5++(IR0) ,R3 

Before Instruction: 



AR5 = 809900h 
IRO = 8h 

R3 = ODCh = 220 

Data at 809900h = 226h = 550 

LUF LV UFNZVC = 0000000 

After Instruction: 

AR5 = 809908h 
IRO = 8h 

R3 = 014Ah = 330 

Data at 809900h = 226h = 550 

LUF LV UF N Z V C = 0 0 0 0 0 00 
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SWI Software Interrupt 



Syntax 
Operation 
Operands 
Encoding 



SWI 

Performs an emulation interrupt 
None 



31 



i — i — i — i — i — r— 
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Description 

Cycles 
Status Bits 



The SWI instruction performs an emulator interrupt. This is a reserved instruc- 
tion and should not be used in normal programming. 



4 




LUF 


Unaffected 


LV 


Unaffected 


UF 


Unaffected 


N 


Unaffected 


Z 


Unaffected 


V 


Unaffected 


c 


Unaffected 



Mode Bit 



OVM Operation is not affected by OVM bit value. 
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Trap Conditionally TRAPcond 



Syntax 
Operation 



Operands 
Encoding 



TRAPcond N 

0 - ST(GIE) 

If cond is true: 

Next PC — *++SP, 
Trap vector N -* PC. 

Else: 

Set ST(GIE) to original state. 
Continue. 

N (0 £ N £ 31) 
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Description 



Interrupts are disabled globally when 0 is written to ST(GIE). If the condition 
is true, the contents of the PC are pushed onto the system stack, and the PC 
is loaded with the contents of the specified trap vector (N). If the condition is 
not true, ST(GIE) is set to its value before the TRAPcond instruction changes 
it. 

The TMS320C3x provides 20 condition codes that can be used with this in- 
struction (see Table 10-9 on page 10-13 for a list of condition mnemonics, 
condition codes, and flags). Condition flags are set on a previous instruction 
only when the destination register is one of the extended-precision registers 
(R7-R0) or when one of the compare instructions (CMPF, CMPF3, CMPI, 
CMPI3, TSTB, or TSTB3) is executed. 



Cycles 
Status Bits 



5 




LUF 


Unaffected 


LV 


Unaffected 


UF 


Unaffected 


N 


Unaffected 


Z 


Unaffected 


V 


Unaffected 


c 


Unaffected 



Mode Bit 



OVM Operation is not affected by OVM bit value. 
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TRAPcond Trap Conditionally 



Example trapz 16 

Before Instruction: 

PC = 123h 
SP = 809870h 
ST = 0h 

Trap Vector 16 = 10h 

LUF LV UF N Z V C = 0 0 0 0 0 00 

After Instruction: 

PC = 10h 

SP = 809871 h 

Data at 809871 h = 124h 

ST = 0h 

LUF LV UF N Z V C = 0 0 0 0 0 00 
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Test Bit Fields TSTB 



Syntax 

Operation 

Operands 



Encoding 



Description 

Cycles 
Status Bits 



Mode Bit 
Example 



TSTB src, dst 
dst AND src 

src general addressing modes (G): 

0 0 register (Rn, 0 * n * 27) 

0 1 direct 

1 0 indirect 
1 1 immediate 

dst register (Rn, 0 * n s 27) 
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i i i i i i i i i i i i i i i 
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The bitwise logical-AND of the dst and src operands is formed, but the result 
is not loaded in any register. This allows for nondestructive compares. The dst 
and src operands are assumed to be unsigned integers. 

1 

These condition flags are modified for all destination registers (R27-R0). 

LUF Unaffected 

LV Unaffected 

UF 0 

N MSB of the output 

Z 1 if a 0 output is generated; 0 otherwise 

V 0 

C Unaffected 

OVM Operation is not affected by OVM bit value. 

TSTB *-AR4(l),R5 

Before Instruction: 

AR4 = 8099C5h 

R5 = 898h = 2200 

Data at 8099C4h = 767h = 1 895 

LUFLVUFN2VC = 000 0 0 00 

After Instruction: 

AR4 = 8099C5h 

R5 = 898h = 2200 

Data at 8099C4h = 767h = 1895 

LUF LV UF N Z V C = 0 0 0 0 1 0 0 
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TSTB3 Test Bit Fields, 3-Operand 



Syntax 

Operation 

Operands 



Encoding 



TSTB3 src2, srd 
srd AND src2 

srd three-operand addressing modes (T): 
0 0 register (Rn1 , 0 * n1 as 27) 

0 1 indirect (disp = 0, 1 , IRO, IR1) 

1 0 register (Rn1 , 0 * n1 s 27) 

1 1 indirect (disp = 0, 1, IRO, IR1) 

src2 three-operand addressing modes (T): 
0 0 register (Rn2, 0 * n2 s 27) 

0 1 register (Rn2, 0 s; n2 s; 1 27) 

1 0 indirect (disp = 0, 1, IRO, IR1) 
1 1 indirect (disp = 0,1, IRO, IR1) 



31 


24 23 




16 


15 




87 




0 


T T 

0 0 1 


1 T T T T 

0 0 1 111 


T 


0 0 0 0 0 


1 1 1 1 1 1 1 

srd 


1 I I I I I I 

src2 



Description 



Cycles 
Status Bits 



The bitwise logical-AND between the srd and src2 operands is formed but is 
not loaded into any register. This allows for nondestructive compares. The 
srd and src2 operands are assumed to be unsigned integers. Although this 
instruction has only two operands, it is designated as a three-operand instruc- 
tion because operands are specified in the three-operand format. 

1 

These condition flags are modified for all destination registers (R27-R0). 

LUF Unaffected 

LV Unaffected 

UF 0 

N MSB of the output 

Z 1 if a 0 output is generated; 0 otherwise 

V 0 

C Unaffected 



Mode Bit 



OVM Operation is not affected by OVM bit value. 
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Test Bit Fields, 3-Operands TSTB3 



Example 1 



TSTB3 



*AR5 — (IR0),*+AR0(1) 



Before Instruction: 

AR5 = 809885h 

IRO = 80h 

ARO = 80992Ch 

Data at 809885h = 898h = 2200 

Data at 80992Dh = 767h = 1 895 

LUF LV UF N Z V C = 0 0 0 0 0 00 

After Instruction: 

AR5 = 809805h 

IRO = 80h 

ARO = 80992Ch 

Data at 809885h = 898h = 2200 

Data at 80992Dh = 767h = 1895 

LUF LV UF N Z V C = 0 0 0 0 1 0 0 



Before Instruction: 

R4 = 0FBC4h 
AR6 = 8099F8h 
IRO = 8h 

Dataat8099F8h = 1568h 

LUF LV UF N Z V C = 0 0 0 0 0 00 
After Instruction: 

R4 = 0FBC4h 
AR6 = 8099F0h 
IRO = 8h 

Dataat8099F8h = 1568h 

LUF LV UF N Z V C = 0 0 0 0 0 00 

i — — — — 1 

Note: Cycle Count 

See subsection 9.5.2 on page 9-24 for operand ordering effects on cycle 
count. 

i i 



Example 2 



TSTB3 R4 , *AR6 — ( IRO ) 



Assembly Language Instructions 
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XOR Bitwise Exclusive-OR 



Syntax XOR src, dst 

Operation dst XOR src dst 

Operands src general addressing modes (G): 

0 0 register (Rn, 0 as n * 27) 

0 1 direct 

1 0 indirect 
1 1 immediate 

dst register (Rn, 0 * n * 27) 

Encoding 



31 


24 23 


16 15 


87 


0 


0 0 0 


l l I I I 

110 10 1 


Q 


i i i i 

dst 


i i i i i i i i i i i i i i I 

src 



Description The bitwise exclusive-OR of the src and dst operands is loaded into the dst 

register. The dst and src operands are assumed to be unsigned integers. 

Cycles 1 

Status Bits These condition flags are modified only if the destination register is R7-R0. 

LUF Unaffected 
LV Unaffected 
UF 0 

N MSB of the output 
Z 1 if a 0 output is generated; 0 otherwise 
V 0 

C Unaffected 

OVM Operation is not affected by OVM bit value. 
XOR R1,R2 

Before Instruction: 



Mode Bit 
Example 



R1 = 0FFA32h 
R2 = 0FF5C1h 

LUF LV UF N Z V C = 0 0 0 0 0 00 
After Instruction: 

R1 = 0FF412h 
R2 = 000FF3h 

LUF LV UF N Z V C = 0 0 0 0 0 00 
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Bitwise Exclusive-PR, 3-Operand X0R3 



Syntax 

Operation 

Operands 



Encoding 



Description 

Cycles 
Status Bits 



XOR3 src2, srd, dst 

srd XOR src2-+ dst 

srd three-operand addressing modes (T): 
0 0 register (Rn1 , 0 as n1 * 27) 

0 1 indirect (disp = 0, 1 , IRO, IR1) 

1 0 register (Rn1 , 0 as n1 * 27) 

1 1 indirect (disp = 0, 1, IRO, IR1) 

src2 three-operand addressing modes (T): 
0 0 register (Rn2 f 0 ss n2 as 27) 

0 1 register (Rn2, 0 as n2 as 27) 

1 0 indirect (disp = 0, 1 , IRO, IR1) 
1 1 indirect (disp = 0,1, IRO, IR1) 

dst register (Rn, 0 * n as 27) 



31 


24 23 


16 15 




87 




0 


— i — r 
0 0 1 


I I I I I 

0 1 0 0 0 0 


1 

T 


I I I I 

dst 


i i i i i i i 

srd 


i i i i i i i 

src2 



The bitwise exclusive-OR between the srd and src2 operands is loaded into 
the dst register. The srd, src2, and dst operands are assumed to be unsigned 
integers. 

1 

These condition flags are modified only if the destination register is R7-R0. 

LUF Unaffected 

LV Unaffected 

UF 0 

N MSB of the output 

Z 1 if a 0 output is generated; 0 otherwise 

V 0 

C Unaffected 



Mode Bit 



OVM Operation is not affected by OVM bit value. 
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X0R3 Bitwise Excluslve-OR, 3-Operand 



Example 1 



Example 2 



X0R3 *AR3++(IR0) ,R7,R4 

Before Instruction: 

AR3 = 809800h 
IR0 = 10h 
R7 = OFFFFh 
R4 = 0h 

Data at 809800h = 5AC3h 

LUF LV UF N Z V C = 0 0 0 0 0 00 

After Instruction: 

AR3 = 80981 Oh 

IR0 = 10h 

R7 = OFFFFh 

R4 = 0A53Ch 

Data at 809800h = 5AC3h 

LUF LV UF N Z V C = 0 0 0 0 0 00 
XOR3 R5,*-AR1(1) ,Rl 

Before Instruction: 

R5 = 0FFA32h 
AR1 = 809826h 
R1 =0h 

Data at 809825h = 0FF5C1 h 

LUF LV UF N Z V C = 0 0 0 0 0 00 

After Instruction: 

R5 = 0FFA32h 
AR1 = 809826h 
R1 = 000F33h 

Data at 809825h = 0FF5C1 h 

LUF LV UF N Z V C = 0 0 0 0 0 00 



Note: Cycle Count 

See subsection 9.5.2 on page 9-24 for operand ordering effects on cycle 
count. 

■ ■ 
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Parallel X0R3 and STI X0R3HSTI 



Syntax X0R3 src2, srtf, d$t1 

|| STI src3, dst2 

Operation srd XOR src2 -* dst 1 

|| src3^dst2 

Operands srd register (Rn1 , 0 * n1 * 7) 

src2 indirect (disp = 0, 1 , IRO, IR1) 

dst1 register (Rn2 t 0 * n2 * 7) 

src3 register (Rn3, 0 * n3 * 7) 

ds*2 indirect (disp = 0, 1 , IRO, IR1) 

Encoding 



31 




24 23 




16 15 




87 




0 


f 

1 1 


1 T 1 T 

10 111 


i i 

dst 


1 1 

srd 


i i 

src3 


dst2 




src2 





Description 



Cycles 
Status Bits 



Mode Bit 



A bitwise exclusive-XOR and an integer store are performed in parallel. All reg- 
isters are read at the beginning and loaded at the end of the execute cycle. This 
means that, if one of the parallel operations (STI) reads from a register and the 
operation being performed in parallel (XOR3) writes to the same register, STI 
accepts as input the contents of the register before it is modified by the XOR3. 

If src2 and dsf2 point to the same location, src2\s read before the write to dst2. 

1 

These condition flags are modified only if the destination register is R7-R0. 
LUF Unaffected 
LV Unaffected 
UF 0 

N MSB of the output 

Z 1 if a 0 output is generated; 0 otherwise 

V 0 

C Unaffected 

OVM Operation is not affected by OVM bit value. 
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X0R3| 1STI Parallel X0R3 and STI 



Example xor3 *ari++,r3,r3 

| | STI R6 , *-AR2 ( IRO ) 

Before Instruction: 

AR1 = 80987Eh 
R3 = 85h 
R6 = ODCh = 220 
AR2 = 8098B4h 
IRO = 8h 

Data at 80987Eh . 85h 
Data at 8098ACh = Oh 

LUF LV UF N Z V C = 0 0 0 0 0 00 

After Instruction: 

AR1 = 80887Fh 
R3 = 0h 

R6 = ODCh = 220 
AR2 = 8098B4h 
IRO = 8h 

Data at 80987Eh = 85h 

Data at 8098ACh = ODCh = 220 

LUF LV UF N Z V C = 0 0 0 0 0 00 



Note: Cycle Count 

See subsection 9.5.2 on page 9-24 for operand ordering effects on cycle 
count. 

■ ■ 
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Chapter 11 



Software Applications 



The TMS320C3x is a powerful digital signal processor with an architecture and 
instruction set designed to find simple solutions to DSP problems. There are 
instructions specifically designed for efficient implementation of DSP algo- 
rithms as well as general-purpose instructions that make the device suitable 
for more general tasks, like any microprocessor. The floating-point and integer 
arithmetic supported by the device let you concentrate on the algorithm and 
pay less attention to scaling, dynamic range, and overflows. 

The purpose of this chapter is to explain how to use the instruction set, the ar- 
chitecture, and the interface of the TMS320C3x processor. It presents coding 
examples for frequently used applications and discusses more involved exam- 
ples and applications. This chapter defines the principles involved in the ap- 
plications and provides the corresponding assembly-language code for in- 
structional purposes and for immediate use. Whenever the detailed explana- 
tion of the underlying theory is too extensive to be included in this manual, ap- 
propriate references are given for further information. 

Major topics discussed in this chapter are listed below. 



Topic Page 

11.1 Processor Initialization ............ ............. 11-2 

11.2 Program Control 11-6 

11.3 Logical and Arithmetic Operations 11-23 

11 .4 Application-Oriented Operations 11-53 

11.5 Programming Tips 11-131 
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Processor Initialization 



11.1 Processor Initialization 

Before you execute a digital signal processing algorithm, you must initialize 
the processor. Initialization usually occurs any time the processor is reset. 

You can reset the processor by applying a low level to the RESET input for sev- 
eral cycles. At this time, the TMS320C3x terminates execution and puts the 
reset vector (that is, the contents of memory location 0) in the program counter. 
The reset vector normally contains the address of the system-initialization rou- 
tine. The hardware reset also initializes various registers and status bits. 

After reset, you can further initialize the processor by executing instructions 
that set up operational modes, memory pointers, interrupts, and the remaining 
functions needed to meet system requirements. 

To configure the processor at reset, you should initialize the following internal 
functions: 

□ Memory-mapped registers 

□ Interrupt structure 

In addition to the initialization performed during the hardware reset (for condi- 
tions after hardware reset, see Chapter 12), Example 11-1 shows coding for 
initializing the TMS320C3x to the following machine state: 

□ All interrupts are enabled. 

□ The overflow mode is disabled. 

□ The data memory page pointer is set to 0. 

□ The internal memory is filled with 0s. 

Note that all constants larger than 1 6 bits should be placed in memory and ac- 
cessed through direct or indirect addressing. 
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Processor Initialization 



Example 11-1. TMS320C3x Processor Initialization 



TITLE PROCESSOR INITIALIZATION 



. global 
• global 

• global 

• global 
. global 
•global 
. global 

PROCESSOR 



RESET, INI T, BEGIN 
INTO , INT1 , INT2 , INT3 
ISR0,ISR1,ISR2,ISR3 
DINT, DMA 



DINT, DMA 

TINTO , TINT1 , XINTO , RINTO , XINTl , RINTl 
TIMEO , TIME1 , XMTO , RCVO , XMT1 , RCV1 
TRAPO , TRAP1 , TRAP 2 , TRPO , TRP1 , TRP2 

INITIALIZATION FOR THE TMS320C3x 



* RESET AND INTERRUPT VECTOR SPECIFICATION. THIS 

* ARRANGEMENT ASSUMES THAT DURING LINKING, THE FOLLOWING 

* TEXT SEGMENT WILL BE PLACED TO START AT MEMORY 

* LOCATION 0. 



.sect "init" 
RESET .word INIT 
INTO .word ISRO 
INT1 .word ISR1 
INT2 .word ISR2 
INT3 .word ISR3 

* XINTO .word XMTO 

* RINTO .word RCVO 

* XINTl .word XMT1 

* RINTl .word RCV1 
TINTO .word TIMEO 
TINT1 .word TIME1 
DINT .word DMA 

•space 20 
TRAPO .word TRPO 
TRAP1 .word TRP1 
TRAP 2 .word TRP2 

•space 29 



Named section 

RS- load address INIT to PC 
INTO- loads address ISRO to PC 
INT1- loads address ISRl to PC 
INT2- loads address ISR2 to PC 
INT3- loads address ISR3 to PC 

; Serial port 0 transmit interrupt processing 
; Serial port 0 receive interrupt processing 
; Serial port 1 transmit interrupt processing 
; Serial port 1 receive interrupt processing 
Timer 0 interrupt processing 
Timer 1 interrupt processing 
DMA interrupt processing 
Reserved space 

Trap 0 vector processing begins 
Trap 1 vector processing begins 
Trap 2 vector processing begins 
Leave space for the other 29 traps 



* IN THE FOLLOWING SECTION, CONSTANTS THAT CANNOT BE REPRESENTED 

* IN THE SHORT FORMAT ARE INITIALIZED. THE NUMBERS IN PARENTHESIS 

* AT THE END OF THE COMMENTS REPRESENT THE OFFSET OF A 

* PARTICULAR CONTROL REGISTER FROM 

* CTRL (808000H) 
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Processor Initialization 



MASK 

BLKO 

BLK1 

STCK 

CTRL 

DMACTL 

TIMOCTL 

TIM1CTL 

SERGLOBO 

SERPRTXO 

SERPRTRO 

SERTIMO 

SERGLOBl 

SERPRTX1 

SERPRTR1 

SERTIM1 

PARINT 

IOINT 

* 



.data 
•word 
•word 
•word 
•word 
•word 
•word 
•word 
•word 

• word 
•word 
•word 
•word 
•word 

• word 
•word 
•word 
•word 

• word 

• text 



OFFFFFFFFH 

0809800H 

0809C00H 

0809F00H 

0808000H 

0000000H 

0000000H 

0000000H 

0000000H 

0000000H 

0000000H 

0000000H 

0000000H 

0000000H 

0000000H 

0000000H 

0000000H 

0000000H 



Beginning address of RAM block 0 
Beginning address of RAM block 1 
Beginning of stack 

Pointer for peripheral-bus memory map 
Init for DMA control (0) 
Init of timer 0 control (32) 
Init of timer 1 control (48) 

0 glbl control (64) 
0 xmt port control (66) 
0 rev port control (67) 

0 timer control (68) 

1 glbl control (80) 
1 xmt port control (82) 
1 rev port control (83) 
1 timer control (84) 

Init of parallel interface control (100) 
Init of I/O interface control (96) 



Init of serial 
Init of serial 
Init of serial 
Init of serial 
Init of serial 
Init of 
Init of 



serial 
serial 



Init of serial 



* THE ADDRESS AT MEMORY LOCATION 0 DIRECTS EXECUTION TO BEGIN HERE 

* FOR RESET PROCESSING THAT INITIALIZES THE PROCESSOR. WHEN RESET 

* IS APPLIED, THE FOLLOWING REGISTERS ARE INITIALIZED TO 0: 

* 



* ST — CPU STATUS REGISTER 

* IE CPU/DMA INTERRUPT ENABLE FLAGS 

* IF — CPU INTERRUPT FLAGS 

* IOF — I/O FLAGS 

* THE STATUS REGISTER HAS THE FOLLOWING ARRANGEMENT: 



* BITS: 31-14 13 12 11 10 9 8 7 6 5 4 3 2 10 

* FUNCTION: RESRV GIE CC CE CF RESRV RM OVM LUF LV UF N Z V C 



INIT LDP 0,DP ; Point the DP register to page 0 

LDI 1800H,ST ; Clear and enable cache, and disable OVM 
LDI § MASK, IE ; Unmask all interrupts 

* 

INTERNAL DATA MEMORY INITIALIZATION TO FLOATING POINT 0 

* 



II 



LDI §BLK0,AR0 
LDI @BLK1 , ARl 
LDF0.0,R0 
RPTS 1023 
STF R0,*AR0++(1) 
STF R0,*AR1++(1) 



AR0 points to block 0 

ARl points to block 1 

0 register R0 

Repeat 1024 times • • . 

Zero out location in RAM block 0 and 

Zero out location in RAM block 1 



11.-4 



Processor Initialization 



THE PROCESSOR IS INITIALIZED. THE REMAINING APPLICATION- 
DEPENDENT PART OF THE SYSTEM (BOTH ON- AND OFF-CHIP) SHOULD 
NOW BE INITIALIZED. 

FIRST, INITIALIZE THE CONTROL REGISTERS. IN THIS EXAMPLE, 
EVERYTHING IS INITIALIZED TO 0, SINCE THE ACTUAL INITIALIZATION IS 
APPLICATION-DEPENDENT . 



LDI @CTRL,AR0 

LDI @DMACTL,R0 
STI RO, *+AR0(0) 



Load in ARO the pointer to control 
registers 

Init DMA control 



LDI §TIM0CTL,R0 
STI RO, *+AR0(32) 
LDI §TIM1CTL,R0 
STI R0,*+AR0(48) 
LDI @ SERGLOBO , RO 
STI R0,*+AR0(64) 
LDI @SERPRTX0,R0 
STI R0,*+AR0(66) 
LDI 6SERPRTR0,R0 
STI R0,*+AR0(67) 
LDI @SERTIM0,R0 
STIR0,*+AR0(68) 
LDI @SERGLOB1,RO 
STI R0,*+AR0(80) 
LDI §SERPRTX1,R0 
STI R0,*+AR0(82) 
LDI §SERPRTR1,R0 
STI R0,*+AR0(83) 
LDI @SERTIM1,R0 
STI R0,*+AR0(84) 
LDI @PARINT,R0 
STI R0,*+AR0(100) 
LDI §IOINT,R0 
STI R0,*+AR0(96) 

LDI§STCK,SP 
OR 2000H,ST 

BR BEGIN 



Init timer 0 control 

Init timer 1 control 

Init serial 0 global control 

Init serial 0 xmt control 

Init serial 0 rev control 

Init serial 0 timer control 

Init serial 1 global control 

Init serial 1 xmt control 

Init serial 1 rev control 

Init serial 1 timer control 

Init parallel interface control (C30 only) 

Init I/O interface control 

Init the stack pointer 
Global interrupt enable 

Branch to the beginning of application 



• end 
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11.2 Program Control 

One group of TMS320C3x instructions provides program control and facili- 
tates all types of high-speed processing. These instructions directly handle: 

□ subroutine calls 

□ software stack 

□ interrupts 

□ zero-overhead branches 

□ single- and multiple-instruction loops without any overhead 

11.2.1 Subroutines 

The TMS320C3x has a 24-bit program counter (PC) and a practically unlimited 
software stack. The CALL and CALLcond subroutine calls cause the stack 
pointer to increment and store the contents of the next value of the PC counter 
on the stack. At the end of the subroutine, RETScond performs a conditional 
return. 

Example 11-2 illustrates the use of a subroutine to determine the dot product 
between two vectors. Given two vectors of length N, represented by the arrays 

a [0], a [1] a [N -1] and b [0], b [1] b [N -1], the dot product is computed 

from the expression 

d = a [0] b [0] + a [1] b [1] + ... + a [N -1] b [N -1] 

Processing proceeds in the main routine to the point where the dot product is 
to be computed. It is assumed that the arguments of the subroutine have been 
appropriately initialized. At this point, a CALL is made to the subroutine, 
transferring control to that section of the program memory for execution, then 
returning to the calling routine via the RETS instruction when execution has 
completed. Note that for this particular example, it would suffice to save the 
register R2. However, a larger number of registers are saved for demonstra- 
tion purposes. The saved registers are stored on the system stack. This stack 
should be large enough to accommodate the maximum anticipated storage re- 
quirements. You could use other methods of saving registers equally well. 
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Program Control 



Example 11-2. Subroutine Call (Dot Product) 
* 

* TITLE SUBROUTINE CALL (DOT PRODUCT) 
* 

* MAIN ROUTINE THAT CALLS THE SUBROUTINE 'DOT' TO COMPUTE THE 

* DOT PRODUCT OF TWO VECTORS 



LDI 
LDI 
LDI 



@blkO,ARO 
@blkl , AR1 
N,RC 



ARO points to vector a 
AR1 points to vector b 
RC contains the number of elements 



CALL DOT 



SUBROUTINE 



DOT 



* EQUATION: d « a(0) * b(0) + a(l) * b(l) + ... + a(N-l) * b(N-l) 
* 

* THE DOT PRODUCT OF a AND b IS PLACED IN REGISTER RO. N MUST 

* BE GREATER THAN OR EQUAL TO 2. 



* ARGUMENT ASSIGNMENTS: 

* ARGUMENT | FUNCTION 



* ARO 

* AR1 

* RC 
★ 



ADDRESS OF a(0) 
ADDRESS OF b(0) 
LENGTH OF VECTORS (N) 



* REGISTERS USED AS INPUT: ARO, ARl, RC 

* REGISTER MODIFIED: RO 

* REGISTER CONTAINING RESULT: RO 
* 



* 

DOT 



. global DOT 



ST 
R2 



PUSH 
PUSH 
PUSHF R2 
PUSH ARO 
PUSH 
PUSH 



ARl 
RC 



Save status register 

Use the stack to save R2's 

Lower 32 and upper 32 bits 

Save ARO 

Save ARl 

Save RC 



Software Applications 
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MPYF3 *AR0,*AR1,R0 

LDF0.0,R2 

SUBI 2,RC 

DOT PRODUCT (1 <« i < N) 

RPTS RC 



Initialize ROs 
a(0) * b(0) -> RO 
Initialize R2 
Set RC - N-2 



Setup the repeat single 



MPYF3 *++AR0(l),*++ARl(l),R0 ; a(i) * b(i) -> RO 



ADDF3 R0,R2,R2 
ADDF3 R0,R2,R0 



a(i-l)*b(i-l) + R2 -> R2 
a(N-l)*b(N-l) + R2 -> RO 



RETURN SEQUENCE 



POP 
POP 
POPF 
POP R2 
POP ST 
RETS 



RC 
ARl 
ARO 
R2 



Restore RC 
Restore ARl 
Restore ARO 

Restore top 32 bits of R2 
Restore bottom 32 bits of R2 
Restore ST 
Return 



end 
• end 



11.2.2 Software Stack 

The TMS320C3x has a software stack whose location is determined by the 
contents of the stack pointer register (SP). The stack pointer increments from 
low to high values, and provisions should be made to accommodate the antici- 
pated storage requirements. The stack can be used not only during the sub- 
routines CALL and RETS, but also inside the subroutine as a place of tempo- 
rary storage of the registers, as shown in Example 11-2. SP always points to 
the last value pushed on the stack. 
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The CALL and CALLcond instructions and the interrupt routines push the 
value of the PC onto the stack. RETScond and RETIcondthen pop the stack 
and place the value in the program counter. You can also use the PUSH and 
POP instructions to maneuver the integer value of any register onto and off the 
stack, respectively. There are two additional instructions, PUSHF and POPF, 
for floating point numbers. You can push and pop floating point numbers to reg- 
isters R7-R0. This feature makes it easy to save all 40 bits of the extended 
precision registers (see Example 11-2). Using PUSH and PUSHF on the 
same register saves the lower 32 and upper 32 bits. PUSH saves the lower 
32; PUSHF, the upper 32. POPF, followed by POP, will recover this extended 
precision number. It is important to perform the integer and floating-point 
PUSH and POP in the order given above. POPF forces the least significant 
eight bits of the extended-precision registers to 0 and therefore must be per- 
formed first. 

You can easily read and write to the SP to create multiple stacks for different 
program segments. SP is not initialized by the hardware during reset. It is 
therefore important to remember to initialize its value so that SP points to a pre- 
determined memory location. This avoids the problem of SP attempting to 
write into ROM or otherwise write over useful data. 

11.2.3 Interrupt Service Routines 

Interrupts on the TMS320C3x are prioritized and vectored. When an interrupt 
occurs, the corresponding flag is set in the interrupt flag register IF. If the corre- 
sponding bit in the interrupt enable register (IE) is set, and interrupts are en- 
abled by having the GIE bit in the status register set to 1 , interrupt processing 
begins. You can also write to the interrupt flag register, allowing you to force 
an interrupt by software or to clear interrupts without processing them. 

Even when the interrupt is disabled, you can read the interrupt flag register (IF) 
and take appropriate action, depending on whether the interrupt has occurred. 
This is true even when the interrupt is disabled. This can be useful when an 
interrupt-driven interface is not implemented. Example 11-3 shows the case 
in which a subroutine is called when interrupt 1 has not occurred. 

Example 11-3. Use of Interrupts for Software Polling 

* TITLE INTERRUPT POLLING 

TSTB 2, IF ; Test if interrupt 1 has occurred 

CALLZ SUBROUTINE ; If not, call subroutine 



Software Applications 1 1 -9 



When interrupt processing begins, the PC is pushed onto the stack, and the 
interrupt vector is loaded in the PC. Interrupts are then disabled by setting the 
GIE = 0, and the program continues from the address loaded in the PC. Since 
all interrupts are disabled, interrupt processing can proceed without further in- 
terruption, unless the interrupt service routine re-enables interrupts. 

Except for very simple interrupt service routines, it is important to ensure that 
the processor context is saved during execution of this routine. You must save 
the context before you execute the routine itself and restore it after the routine 
is finished. The procedure is called context switching. Context switching is also 
useful for subroutine calls, especially during extensive use of the auxiliary and 
the extended precision registers. This section contains code examples of con- 
text switching and an interrupt service routine. 
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11.2.3.1 Context Switching 

Context switching is commonly required during the processing of subroutine 
calls or interrupts. It might be quite extensive or it might be simple, depending 
on system requirements. On the TMS320C3x, the program counter is auto- 
matically pushed onto the stack. Important information in other TMS320C3x 
registers, such as the status, auxiliary, or extended-precision registers, must 
be saved by special commands. In order to preserve the state of the status reg- 
ister, you should push it first and pop it last. This keeps the restoration of the 
extended precision registers from affecting the status register. 

Example 11-4 and Example 11-5 show saving and restoring of the 
TMS320C3x state. In both examples, the stack is used for saving the registers, 
and it expands towards higher addresses. If you don't want to use the stack 
pointed at by SP, you can create a separate stack by using an auxiliary register 
as the stack pointer. Registers saved in these examples are: 

□ Extended-precision registers R7 through RO 

□ Auxiliary registers AR7 through ARO 

□ Data-page pointer DP 

□ Index registers IRO and IR1 

□ Block-size register BK 

□ Status register ST 

□ Interrupt-related registers IE and IF 

□ I/O flag IOF 

□ Repeat-related registers RS, RE, and RC 



Software Applications 11-11 



Program Control 



Example 11-4. Context Save for the TMS320C3x 

* TITLE CONTEXT SAVE FOR THE TMS320C3x 
★ 

. global SAVE 

* 

* CONTEXT SAVE ON SUBROUTINE CALL OR INTERRUPT 
* 

SAVE: 

PUSH ST ; Save status register 

* SAVE THE EXTENDED PRECISION REGISTERS 
* 



PUSH 


RO 


} Save the lower 32 


bits 






PUSHF 


RO 


; and the upper 


32 bits 


of 


RO 


PUSH 


Rl 


) Save the lower 32 


bits 






PUSHF 


Rl 


} and the upper 


32 bits 


of 


Rl 


PUSH 


R2 


? Save the lower 32 


bits 






PUSHF 


R2 


and the upper 


32 bits 


of 


R2 


PUSH 


R3 


) Save the lower 32 


bits 






PUSHF 


R3 


? and the upper 


32 bits 


of 


R3 


PUSH 


R4 , 


Save the lower 32 


bits 






PUSHF 


R4 


and the upper 


32 bits 


of 


R4 


PUSH 


R5 


Save the lower 32 


bits 






PUSHF 


R5 


and the upper 


32 bits 


of 


R5 


PUSH 


R6 


Save the lower 32 


bits 






PUSHF 


R6 


and the upper 


32 bits 


of 


R6 


PUSH 


R7 , 


Save the lower 32 


bits 






PUSHF 


R7 


and the upper 


32 bits 


of 


R7 



* SAVE THE AUXILIARY REGISTERS 
* 



PUSH 


ARO 


; Save 


ARO 


PUSH 


AR1 


; Save 


AR1 


PUSH 


AR2 


; Save 


AR2 


PUSH 


AR3 


; Save 


AR3 


PUSH 


AR4 


} Save 


AR4 


PUSH 


AR5 


} Save 


AR5 


PUSH 


AR6 


j Save 


AR6 


PUSH 


AR7 


; Save 


AR7 
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* SAVE THE REST REGISTERS FROM THE REGISTER FILE 
* 



PUSH 


DP , 


Save data page pointer 


nrio U 

ruon 


XKU 


Save index register IRO 


PUSH 


IR1 


Save index register IR1 


PUSH 


BK 


; Save block-size register 


PUSH 


IE , 


; Save interrupt enable register 


PUSH 


IF , 


* f Save interrupt flag register 


PUSH 


IOF , 


; Save I/O flag register 


PUSH 


RS t 


; Save repeat start address 


PUSH 


RE 


; Save repeat end address 


PUSH 

* 


RC , 


; Save repeat counter 


* SAVE 
* 


IS COMPLETE 
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Example 11-5. Context Restore for the TMS320C3x 

* 

* TITLE CONTEXT RESTORE FOR THE TMS320C3X 
* 

.global RESTR 



* CONTEXT RESTORE AT THE END OF A SUBROUTINE CALL OR INTERRUPT 



RESTR: 



* RESTORE THE REST REGISTERS FROM THE REGISTER FILE 



POP RC 


f Restore repeat counter 


POP RE 


; Restore repeat end address 


T>/"\1> r\r* 


; Restore repeat start address 


POP IOF 


f Restore I/O flag register 


POP IF 


) Restore interrupt flag register 


POP IE 


f Restore interrupt enable register 


POP BK 


Restore block-size register 


POP IR1 


; Restore index register IRl 


POP IRO 


Restore index register IRO 


POP DP 


; Restore data page pointer 



* RESTORE THE AUXILIARY REGISTERS 

* 



POP AR7 , 


Restore 


AR7 


POP AR6 


Restore 


AR6 


POP AR5 , 


Restore 


AR5 


POPAR4 


Restore 


AR4 


POP AR3 , 


Restore 


AR3 


POP AR2 , 


Restore 


AR2 


POPAR1 , 


Restore 


AR1 


POPARO t 


Restore 


ARO 



* RESTORE THE EXTENDED PRECISION REGISTERS 
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POPF 


R7 


POP R7 




POPF 


R6 


POP R6 




POPF 


R5 


POP R5 




POPF 


R4 


POP R4 




POPF 


R3 


POP R3 








POP R2 




POPF 


Rl 


POPR1 




POPF 


RO 


POPRO 




POP ST 





Restore the upper 32 bits and 

the lower 32 bits of R7 
Restore the upper 32 bits and 

the lower 32 bits of R6 
Restore the upper 32 bits and 

the lower 32 bits of R5 
Restore the upper 32 bits and 

the lower 32 bits of R4 
Restore the upper 32 bits and 

the lower 32 bits of R3 
Restore the upper 32 bits and 

the lower 32 bits of R2 
Restore the upper 32 bits and 

the lower 32 bits of Rl 
Restore the upper 32 bits and 

the lower 32 bits of RO 
Restore status register 



RESTORE IS COMPLETE 
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11.2.3.2 Interrupt Priority 

Interrupts on the TMS320C3x are automatically prioritized. This allows inter- 
rupts that occur simultaneously to be serviced in a predefined order. Infrequent 
but lengthy interrupt service routines might need to be interrupted by more fre- 
quently occurring interrupts. In Example 11-6, the interrupt service routine for 
INT2 temporarily modifies the IE to permit interrupt processing when an inter- 
rupt to INTO (but no other interrupt) occurs. When the routine has finished pro- 
cessing, the IE register is restored to its original state. Notice that the 
RETIconofinstruction not only pops the next program counter address from the 
stack, but also sets the GIE bit of the status register. This enables all interrupts 
that have their interrupt enable bit set. 



Example 11-6. Interrupt Service Routine 

* TITLE INTERRUPT SERVICE ROUTINE 

* .global ISR2 
ENABLE .set 2000h 
MASK . set 1 

* 

* INTERRUPT PROCESSING FOR EXTERNAL INTERRUPT INT2- 



ISR2: 



PUSH ST 
PUSH DP 
PUSH IE 
PUSH RO 
PUSHF RO 
PUSH Rl 
PUSHF Rl 
LDI MASK, IE 
OR ENABLE, ST 



Save status register 

Save data page pointer 

Save interrupt enable register 

Save lower 32 bits and 

upper 32 bits of RO 
Save lower 32 bits and 

upper 32 bits of Rl 
Unmask only INTO 
Enable all interrupts 



MAIN PROCESSING SECTION FOR ISR2 



XOR ENABLE, ST 

POPF Rl 

POPR1 

POPF RO 

POPRO 

POP IE 

POP DP 

POP ST 



Disable all interrupts 
Restore upper 32 bits and 

lower 32 bits of Rl 
Restore upper 32 bits and 

lower 32 bits of RO 
Restore interrupt enable register 
Restore data page register 
Restore status register 



RETI 



Return and enable interrupts 
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11.2.4 Delayed Branches 



The TMS320C3x uses delayed branches to create single-cycle branching. 
The delayed branches operate like regular branches but do not flush the pipe- 
line. Instead, the three instructions following a delayed branch are also ex- 
ecuted. As discussed in Chapter 6, the only limitations are that none of the 
three instructions following a delayed branch can be a: 

□ Branch (standard or delayed) 

□ Call to a subroutine 

□ Return from a subroutine 

□ Return from an interrupt 

□ Repeat instruction 

□ TRAP instruction 

□ IDLE instruction 

Conditional delayed branches use the conditions that exist at the end of the 
instruction immediately preceding the delayed branch. Sometimes a branch 
is necessary in the flow of a program, but fewer than three instructions can be 
placed after a delayed branch. For faster execution, it is still advantageous to 
use a delayed branch. This is shown in Example 11-7, with NOPs taking the 
place of the unused instructions. The trade-off is more instruction words for 
less execution time. 



Example 11-7. Delayed Branch Execution 



TITLE DELAYED BRANCH EXECUTION 



LDF 

BGED 

LDFN 

SUBF 

NOP 



*+ARl(5) ,R2 
SKIP 
R2 ,R1 
3.0,R1 



Load contents of memory to R2 
If loaded number >=0, branch (delayed) 
If loaded number <0 , load it to Rl 
Subtract 3 from Rl 

Dummy operation to complete delayed 



MPYF 



1.5,R1 



branch 

Continue here if loaded number <0 



SKIP LDFR1,R3 



Continue here if loaded number >=0 
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11.2.5 Repeat Modes 

The TMS320C3x supports looping without any overhead. For that purpose, 
there are two instructions: RPTB repeats a block of code, and RPTS repeats 
a single instruction. There are three control registers: repeat start address 
(RS), (repeat end address (RE), and repeat counter (RC). These contain the 
parameters that specify loop execution (refer to Section 6.1 on page 6-2 for 
a complete description of RPTB and RPTS). RS and RE are automatically set 
from the code, while you must set RC, as shown in the examples below. 

11.2.5.1 Block Repeat 

Example 11-6 shows an application of the block repeat construct. In this ex- 
ample, an array of 64 elements is flipped over by exchanging the elements that 
are equidistant from the end of the array. In other words, if the original array is 

a(1),a(2) a(31),a(32) a(64); 

the final array after the rearrangement will be 

a(64),a(63) a(32),a(31) a(1). 

Because the exchange operation is done on two elements at the same time, 
it requires 32 operations. The repeat counter RC is initialized to 31 . In general, 
if RC contains the number N, the loop will be executed N + 1 times. The loop 
is defined by the RPTB instruction and the EXCH label. 
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Example 11-8. Loop Using Block Repeat 

* TITLE LOOP USING BLOCK REPEAT 

* THIS CODE SEGMENT EXCHANGES THE VALUES OF ARRAY ELEMENTS THAT ARE 

* SYMMETRIC AROUND THE MIDDLE OF THE ARRAY. 





LDI @ADDR, ARO ; ARO points to the beginning of the array 




LDI ARO 


,AR1 






ADD I 


63,AR1 , 


AR1 points to the end of the 


* 






64 -element array 




LDI 


31, RC 


Initialize repeat counter 




RPTB 


EXCH , 


Repeat RC+1 times between here and 


* 






EXCH 




LDI 


*AR0,R0 


Load one memory element in RO, 


II 


LDI 


*ARl ,Rl , 


and the other in Rl 


EXCH 


STIR1, 


*AR0++(1) , 


Then, exchange their locations 


II 


STIRO, 


*AR1 — (1) 





Subsection 6.1.2 on page 6-3 specifies restrictions in the block-repeat con- 
struct. Because the program counter is modif ied at the end of the loop accord- 
ing to the contents of the registers RS, RE, and RC, no operation should at- 
tempt to modify the repeat counter or the program counter at the end of the 
loop in a different way. 

In principle, it is possible to nest repeat blocks. However, there is only one set 
of control registers: RS, RE, and RC. It is therefore necessary to save these 
registers before entering an inside loop. It might be more practical to imple- 
ment a nested loop by the more traditional method of using a register as a 
counter and then using a delayed branch rather than using the nested repeat 
block approach. 

Example 1 1 -9 shows another example of using the block repeat to find a maxi- 
mum of 1 47 numbers. 
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Example 1 1-9. Use of Block Repeat to Find a Maximum 



* 

* TITLE USE OF BLOCK REPEAT TO FIND A MAXIMUM 
* 

* THIS ROUTINE FINDS THE MAXIMUM OF N * 147 NUMBERS. 



LDI 
LDI 
LD 



146, RC 
@ADDR, ARO 
*AR0++(1) ,R0 



Initialize repeat counter to 147-1 
ARO points to beginning of array 
Initialize MAX to the first value 



RPTB LOOP 
CMPF *AR0++(1),R0 
LOOP LDFLT *- ARO ( 1 ) , RO 



Compare number to the maximum 

If greater, this is a new maximum 



11.2.5.2 Single-Instruction Repeat 

The single-instruction repeat uses the control registers RS, RE, and RC in the 
same way as the block repeat. The advantage over the block repeat is that the 
instruction is fetched only once, and then the buses are available for moving 
operands. Note that the single-instruction repeat construct is not interruptible, 
while block repeat is interruptible. 

Example 11-10 shows an application of the single-repeat construct. In this ex- 
ample, the sum of the products of two arrays is computed. The arrays are not 
necessarily different. If the arrays are a(i) and b(i), each of length N = 512, 
register RO will contain, after computation, this quantity: 

a (1) b (1) + a (2) b (2) +...+ a (N) b (N). 

The value of the RC is specified to be 51 1 in the instruction. If RC contains the 
number N, the loop will be executed N + 1 times. 
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Example 11-10. Loop Using Single Repeat 



* TITLE LOOP USING SINGLE REPEAT 

* THIS CODE SEGMENT COMPUTES SUM[ a( i )b( i) ] FOR i - 1 to N. 

* 



LDI 
LDI 



§ADDRl , ARO 
@ADDR2,AR1 



LDF 0.0, RO 

MPYF3 *AR0++ ( 1 ) , *AR1++ ( 1 ) , Rl 
RPTS 511 



ARO points to array a(i) 
AR1 points to array b(i) 

Initialize RO 



Compute first product 
Repeat 512 times 



MPYF3 *AR0++ ( 1 ) , *ARl++ ( 1 ) , Rl , RO 
ADDF3 R1,R0,R0 



Compute next product 
and accumulate the 
previous one 



ADDF Rl f RO 



One final addition 
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11.2.6 Computed GOTOs 

It is occasionally convenient to select during run time (and not during assem- 
bly) the subroutine to be executed. The TMS320C3x's computed GOTO sup- 
ports this selection. The computed GOTO is implemented using the CALLcond 
instruction in the register-addressing mode. This instruction uses the contents 
of the register as the address of the call. Example 11-11 shows a computed 
GOTO for a task controller. 



Example 11-11. Computed GOTO 



TITLE COMPUTED GOTO 
TASK CONTROLLER 



THIS MAIN ROUTINE CONTROLS THE ORDER OF TASK EXECUTION (6 TASKS 
IN THE PRESENT EXAMPLE ) . TASKO THROUGH TASK 5 ARE THE NAMES OF 
SUBROUTINES TO BE CALLED. THEY ARE EXECUTED IN ORDER, TASKO , 
TASKl , . . . TASK5 . WHEN AN INTERRUPT OCCURS, THE INTERRUPT 
SERVICE ROUTINE IS EXECUTED, AND THE PROCESSOR CONTINUES 
WITH THE INSTRUCTION FOLLOWING THE IDLE INSTRUCTION. THIS 
ROUTINE SELECTS THE TASK APPROPRIATE FOR THE CURRENT CYCLE, 
CALLS THE TASK AS A SUBROUTINE, AND BRANCHES BACK TO THE IDLE 
TO WAIT FOR THE NEXT SAMPLE INTERRUPT WHEN THE SCHEDULED TASK 
HAS COMPLETED EXECUTION. RO HOLDS THE OFFSET FROM THE BASE 
ADDRESS OF THE TASK TO BE EXECUTED. 





LDI 


5,R0 


; Initialize RO 




LDI 


@ADDR, AR1 


; AR1 holds base address of the table 


WAIT 


IDLE 




* t Wait for the next interrupt 




ADD 1 3 


*AR1,R0,AR2 


; Add the base address to the table 


* 






f Entry number 




SUBI 


1,R0 


Decrement RO 




LDILT 


5,R0 


f If R0<0, reinitialize it to 5 




LDI 


*AR2,R1 


: Load the task address 




CALLU 


Rl 


Execute appropriate task 


* 


BR 


WAIT 




TSKSEQ .word 


TASK5 


? Address of TASK 5 




.word 


TASK4 


; Address of TASK4 




.word 


TASK 3 


\ Address of TASK3 




.word 


TASK 2 


f Address of TASK 2 




.word 


TASKl 


; Address of TASKl 




.word 


TASKO 


) Address of TASKO 


ADDR 


.word 


TSKSEQ 
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11.3 Logical and Arithmetic Operations 

The TMS320C3x instruction set supports both integer and floating-point arith- 
metic and logical operations. The basic functions of such instructions can be 
combined to form more complex operations. This section examines examples 
of these operations: 

□ Bit manipulation 

□ Block moves 

□ Bit-reversed addressing 

□ Integer and floating-point division 

□ Square root 

□ Extended-precision arithmetic 

□ Floating-point format conversion between IEEE and TMS320C3x formats 



11.3.1 Bit Manipulation 

Instructions for logical operations, such as AND, OR, NOT, ANDN, and XOR 
can be used together with the shift instructions for bit manipulation. A special 
instruction, TSTB, tests bits. TSTB performs the same operation as AND, but 
the result of the logical AND is only used to set the condition flags and is not 
written anywhere. Example 11-12 and Example 11-13 demonstrate the use 
of the several instructions for bit manipulation and testing. 



Example 1 1-12. Use of TSTB for Software-Controlled Interrupt 

* TITLE USE OF TSTB FOR SOFTWARE-CONTROLLED INTERRUPT 

* IN THIS EXAMPLE, ALL INTERRUPTS HAVE BEEN DISABLED BY 

* RESETTING THE GIE BIT OF THE STATUS REGISTER. WHEN AN 

* INTERRUPT ARRIVES, IT IS STORED IN THE IF REGISTER. THE 

* PRESENT EXAMPLE ACTIVATES THE INTERRUPT SERVICE ROUTINE INTR 

* WHEN IT DETECTS THAT INT2- HAS OCCURRED. 



TSTB 0100b, IF ; Check if bit 2 of IF is set, 
CALLNZ INTR ; and, if so, call subroutine INTR 
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Example 1 1-13. Copy a Bit From One Location to Another 

* TITLE COPY A BIT FROM ONE LOCATION TO ANOTHER 



BIT I OF Rl NEEDS TO BE COPIED TO BIT J OF R2 . 

ARO POINTS TO A LOCATION HOLDING I, AND IT IS ASSUMED THAT THE 
NEXT MEMORY LOCATION HOLDS THE VALUE J. 



J 
1 



RX 



R2 



*AR0 



*(AR0+1) 



LDI 


1,R0 




LSH 


*AR0,R0 , 


Shift 1 to align it with bit I 


TSTB 


R1,R0 


} Test the Ith bit of Rl 


BZD 


CONT , 


If bit = 0, branch delayed 


LDI 


1,R0 




LSH 


*+AR0(l),R0 , 


; Align 1 with Jth location 


ANDN 


R0,R2 , 


f If bit * 0, reset Jth bit of R2 


OR 


R0,R2 


f If bit - 1, set Jth bit of R2 



CONT 
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11.3.2 Block Moves 

Since the TMS320C3x directly addresses a large amount of memory, blocks 
of data or program code can be stored off-chip in slow memories and then 
loaded on-chip for faster execution. Data can also be moved from on-chip to 
off-chip memory for storage or for multiprocessor data transfers. 

You can use direct memory access (DMA) in parallel with CPU operations to 
accomplish such data transfers. The DMA operation is explained in detail in 
subsection 8.3 on page 8-43. An alternative to DMA is to perform data trans- 
fers under program control using load and store instructions in a repeat mode. 
Example 11-14 shows the transfer of a block of 512 floating-point numbers 
from external memory to block 1 of the on-chip RAM. 

Example 11-14. Block Move Under Program Control 

* TITLE BLOCK MOVE UNDER PROGRAM CONTROL 
★ 

extern .word 01000H 
blockl .word 0809C00H 



LDI 


@ extern, ARO 


; Source address 


LDI 


Gblockl, AR1 


; Destination address 


LDF 


*AR0++,R0 


; Load the first number 


RPTS 


510 


f Repeat following instruction 511 times 


LDF 


*AR0++,R0 


; Load the next number, and... 


STF 


R0,*AR1++ 


f store the previous one 


STF 


R0,*AR1 


; Store the last number 



1 1 .3.3 Bit-Reversed Addressing 

The TMS320C3x can implement fast Fourier transforms (FFT) with bit-rev- 
ersed addressing. If the data to be transformed is in the correct order, the final 
result of the FFT is scrambled in bit-reversed order. To recover the frequency- 
domain data in the correct order, you must swap certain memory locations. 
The bit-reversed addressing mode makes swapping unnecessary. The next 
time data needs to be accessed, the access is performed in a bit-reversed 
manner rather than sequentially. The base address of bit-reversed addressing 
must be located on a boundary of the size of the table. For example, if IRO = 
2 n ~ 1 , the n LSBs of the base address must be 0. 
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In bit-reversed addressing, IRO holds a value equal to one-half the size of the 
FFT, if real and imaginary data are stored in separate arrays. During access- 
ing, the auxiliary register is indexed by IRO, but with reverse carry propagation. 
Example 11-15 illustrates a 512-point complex FFT being moved from the 
place of computation (pointed at by ARO) to a location pointed at by AR1 . In 
this example, real and imaginary parts XR(i) and XI (i) of the data are not stored 

in separate arrays, but they are interleaved XR(0), Xl(0), XR(1), Xl(1) 

XR(N-1), XI(N-1). Because of this arrangement, the length of the array is 2N 
instead of N, and IRO is set to 512 instead of 256. 



Example 11-15. Bit-Reversed Addressing 



* TITLE BIT-REVERSED ADDRESSING 
* 

* THIS EXAMPLE MOVES THE RESULT OF THE 512-POINT FFT 

* COMPUTATION POINTED AT BY ARQ TO A LOCATION POINTED A1 

* BY ARl. REAL AND IMAGINARY POINTS ARE ALTERNATING. 



LDI 


512, IRO 




LDI 


2,IR1 




LDI 


511, RC ; 


Repeat 511+1 times 


LDF 


*+AR0(l),Rl 


Load first imaginary point 


RPTB 

* 


LOOP 




LDF 


*AR0++(IR0)B,R0 ; 


Load real value (and point 


| | STF 


R1,*+AR1(1) : 


to next location) and store 


* 




the imaginary value 


LOOP LDF 


*+AR0(l),Rl } 


Load next imaginary point and store 


| | STF 


R0,*AR1++(IR1) ? 


previous real value 



11.3.4 Integer and Floating-Point Division 

Although division is not implemented as a single instruction in the 
TMS320C3x, the instruction set has the capacity to perform an efficient divi- 
sion routine. Integer and floating-point division are examined separately be- 
cause different algorithms are used. 
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11.3.4.1 Integer Division 

Division is implemented on the TMS320C3x by repeated subtractions using 
SUBC, a special conditional subtract instruction. Consider the case of a 32-bit 
positive dividend with i significant bits (and 32 - i sign bits) and a 32-bit positive 
divisor with j significant bits (and 32 - j sign bits). The repetition of the SUBC 
command i - j + 1 times produces a 32-bit result in which the lower i - j + 
1 bits are the quotient and the upper 31 - i + j bits are the remainder of the 
division. 

SUBC implements binary division in the same manner that long division imple- 
ments it. The divisor which is assumed to be smaller than the dividend) is 
shifted left i - j times to be aligned with the dividend. Then, using SUBC, the 
shifted divisor is subtracted from the dividend. For each subtraction that does 
not produce a negative answer, the dividend is replaced by the difference. It 
is then shifted to the left, and a 1 is put in the LSB. If the difference is negative, 
the dividend is simply shifted left by 1. This operation is repeated 
i - j + 1 times. 
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As an example, consider the division of 33 by 5, using both long division and 
the SUBC method. In this case, i = 6, j = 3, and the SUBC operation is repeated 
6-3 + 1=4 times. 

Long Division: 

I 00000000000000000000000000000110 _ 4 . 4 

— - Quotient 

000000000000000000000000000001 01 | 000000000000000000000000001 00001 

-101 

1101 

-101 

1 1 Remainder 



SUBC Method: 



00000000000000000000000000100001 
000000000000000000000000001 01 000 

Negative Difference 
i 

000000000000000000000000001 0001 0 
000000000000000000000000001 01 000 

00000000000000000000000000011 01 0 

I 

0000000000000000000000000011 01 01 
000000000000000000000000001 01 000 

00000000000000000000000000001101 

i 

00000000000000000000000000011 01 1 
000000000000000000000000001 01 000 

Negative Difference 

i 

0000000000000000000000000011 0110 

1 



Remainder 



1 

Quot. 



Dividend 
Divisor (Aligned) 
(First SUBC Command) 



New Dividend + Quotient 
Divisor 

Difference (> 0) (Second SUBC Command) 



New Dividend + Quotient 
Divisor 

Difference (> 0) (Third SUBC Command) 



New Dividend + Quotient 
Divisor 

(Fourth SUBC Command) 



Final Result 



When the SUBC command is used, both the dividend and the divisor must be 
positive. Example 11-16 shows an example of a realization of the integer divi- 
sion in which the sign of the quotient is properly handled. The last instruction 
before returning modifies the condition flag in case subsequent operations de- 
pend on the sign of the result. 
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Example 11-16. Integer Division 



* TITLE INTEGER DIVISION 
* 

SUBROUTINE DIVI 

* 
* 

* INPUTS: SIGNED INTEGER DIVIDEND IN RO, 

* SIGNED INTEGER DIVISOR IN Rl 
* 

* OUTPUT: R0/R1 into RO 

* REGISTERS USED: R0-R3, IRO, IRl 
★ 

* OPERATION: 1. NORMALIZE DIVISOR WITH DIVIDEND 

* 2. REPEAT SUBC 

* 3. QUOTIENT IS IN LSBs OF RESULT 
* 

* CYCLES: 31-62 (DEPENDS ON AMOUNT OF NORMALIZATION) 

.globl DIVI 

SIGN .set R2 

TEMPF . set R3 

TEMP . set IRO 

COUNT . set IRl 

* DIVI - SIGNED DIVISION 



DIVI: 

* DETERMINE SIGN OF RESULT. GET ABSOLUTE VALUE OF OPERANDS. 
* 



XOR R0,R1,SIGN ; Get the sign 
ABSI RO 
ABSI Rl 



CMPI R0,R1 ; Divisor > dividend ? 

BGTD ZERO ; If so, return 0 

★ 



* NORMALIZE OPERANDS. USE DIFFERENCE IN EXPONENTS AS SHIFT COUNT 

* FOR DIVISOR AND AS REPEAT COUNT FOR 'SUBC. 



FLOAT RO, TEMPF 
PUSHF TEMPF 
POP COUNT 
LSH-24,COUNT 



Normalize dividend 
PUSH as float 
POP as int 

Get dividend exponent 
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FLOAT 

PUSHF 

POP 

LSH 

SUBI 

LSH 



R1,TEMPF 

TEMPF 

TEMP 

-24 , TEMP 
TEMP , COUNT 
COUNT, Rl 



Normalize divisor 
PUSH as float 
POP as int 

Get divisor exponent 

Get difference in exponents 

Align divisor with dividend 



DO COUNT+1 SUBTRACT & SHIFTS. 



RPTS 
SUBC 



COUNT 
R1,R0 



MASK OFF THE LOWER COUNT+1 BITS OF RO . 



SUBRI 31 .COUNT 

LSH COUNT , RO 

NEGI COUNT 

LSH COUNT, RO 



Shift count is (32 - (COUNT+1)) 
Shift left 

Shift right to get result 



CHECK SIGN AND NEGATE RESULT IF NECESSARY. 



NEGI R0,R1 

ASH -31, SIGN 

LDINZ R1,R0 

CMPI 0,R0 
RETS 



Negate result 
Check sign 

If set, use negative result 
Set status from result 



RETURN 0. 



LDI 
RETS 
• end 



0,R0 



If the dividend is less than the divisor and you want fractional division, you can 
perform a division after you determine the desired accuracy of the quotient in 
bits. If the desired accuracy is k bits, start by shifting the dividend left by k posi- 
tions. Then apply the algorithm described above, with i replaced by i + k. It is 
assumed that i + k is less than 32. 



11-30 



Logical and Arithmetic Operations 



11.3.4.2 Computation of Floating-Point Inverse and Division 

This section presents a method of implementing floating-point division on the 
TMS320C3x. Since the algorithm outlined here computes the inverse of a 
number v, to perform y / v, multiply y by the inverse of v. 

The computation of 1 / v is based on the following iterative algorithm. At the 
ith iteration, the estimate x [i] of 1 / v is computed from v and the previous esti- 
mate x [i-1] according to the following formula: 

x[i] = x[i-1]*(2.0-v*x[i-1]) 

To start the operation, an initial estimate x [0] is needed. If v = a * 2 e , a good 
initial estimate is 

x[0] = 1.0*2~e- 1 

Example 11-17 shows the implementation of this algorithm on the 
TMS320C3x, where the iteration has been applied five times. Both accuracy 
and speed are affected by the number of iterations. The accuracy offered by 
the single-precision floating-point format is 2 - 23 = 1 .192E - 7. If you want 
more accuracy, use more iterations. If you want less accuracy, reduce the 
number of iterations to increase the execution speed. 

This algorithm properly treats the boundary conditions when the input number 
either is 0 or has a very large value. When the input is 0, the exponent 
e =-128. Then the calculation of x[0] yields an exponent equal to 
- (- 128) -1 = 127, and the algorithm will overflow and saturate. On the other 
hand, in the case of a very large number, e = 1 27, the exponent of x [0] will be 
- 1 27 - 1 = - 1 28. This will cause the algorithm to yield 0, which is a reasonable 
handling of that boundary condition. 
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Example 11-17, Inverse of a Floating-Point Number 
* 

* TITLE INVERSE OF A FLOATING-POINT NUMBER 

* SUBROUTINE INVF 

* THE FLOATING-POINT NUMBER V IS STORED IN RO . AFTER THE 

* COMPUTATION IS COMPLETED , 1/v IS ALSO STORED IN RO . 
* 

* TYPICAL CALLING SEQUENCE: 

* LDFv, RO 

* CALL INVF 
* 



* ARGUMENT ASSIGNMENTS: 

* ARGUMENT | FUNCTION 

* ^ 

* RO 

* RO 



V - NUMBER TO FIND THE RECIPROCAL OF (UPON THE CALL) 
1/v (UPON THE RETURN) 

* REGISTER USED AS INPUT: RO 

* REGISTERS MODIFIED: RO, Rl, R2, R3 

* REGISTER CONTAINING RESULT: RO 

* CYCLES: 35 WORDS: 32 



.global INVF 

* 

INVF: LDF RO , R3 ; v is saved for later 

ABSF RO ; The algorithm uses v * |v| 

* 

* EXTRACT THE EXPONENT OF V. 

PUSHF RO 

POP Rl 

ASH -24,R1 ; The 8 LSBs of Rl contain the exponent 

* ; of v 



x[0] FORMATION IS GIVEN THE EXPONENT OF V, 
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NEGI 
SUBI 
ASH 
PUSH 
POPF 



Rl 

1,R1 
24, Rl 
Rl 
Rl 



; NOW Rl = X[0] 
NOW THE ITERATIONS BEGIN. 



Now we have -e-1, the exponent of x[0] 
1.0 * 2**(-e-l) 



MPYF 


Rl ,R0,R2 


; R2 


as 


v * x[ 0 ] 








SUBRF 


2.0,R2 


; R2 




2.0 - v * x[0] 








MPYF 


R2 ,R1 


} Rl 




x[l] ■ x[0] * (2.0 


- V 


* 


x[0]) 


MPYF 


Rl, R0,R2 


; R2 




v * x[l] 








SUBRF 


2.0,R2 , 


\ R2 


as 


2.0 - v * x[l] 








MPYF 


R2,R1 


} Rl 




x[2] = x[l] * (2.0 


- V 


* 


x[l]) 


MPYF 


R1,R0,R2 , 


; R2 




v * x[2] 








SUBRF 


2.0,R2 


\ R2 




2.0 - v * x[2] 








MPYF 


R2,R1 , 


» Rl 




x[3] * x[2] * (2.0 


- V 


* 


x[2]) 


MPYF 


R1,R0,R2 , 


\ R2 




v * x[3] 








SUBRF 


2.0,R2 , 


■ R2 




2.0 - v * x[3] 








MPYF 


R2,R1 < 


» Rl 


m 


x[4] » x[3] * (2.0 


- V 


* 


x[3]) 


RND 


Rl ; This 


minimizes error in 


the 


LSBs 



FOR THE LAST ITERATION WE USE THE FORMULATION 2 
x[5] = (x[4] * (1.0 - (v * x[4]))) + x[4] 



MPYF R1,R0,R2 

SUBRF 1.0, R2 

MPYF R1,R2 

ADDF R2,R1 



=> 0 



R2 - v * x[4] « 1.0. .01. . «> 1 

R2 m i.o - v * x[4] - 0.0.. 01.. 

R2 - x[4] * (1.0 - v * x[4] ) 

R2 - x[5] * (x[4]*(1.0-(v*x[4] ) ) )+x[4] 



RNDRl ,R0 ; Round since this is followed by a MPYF 

NOW THE CASE OF V < 0 IS HANDLED. 



* 

END 



NEGF 

LDF 

LDFN 

RETS 



• end 



R0,R2 
R3,R3 
R2,R0 



This sets condition flags 
If v < 0, then R0 * -R0 
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11.3.5 Square Root 

An iterative algorithm computes square root on the TMS320C3x and is similar 
to the one used for the computation of the inverse. This algorithm computes 
the inverse of the square root of a number v, 1 / SQRT(v). To derive SQRT(v), 
multiply this result by v. Since in many applications, division by the square root 
of a number is desirable, the output of the algorithm saves the effort to compute 
the inverse of the square root. 

At the ith iteration, the estimate x[i] of 1 / SQRT(v) is computed from v and the 
previous estimate x[i-1] according to this formula: 

x[i]=x[i~1]*(1.5-(v/2)*x[i-1]*x[i-1]) 

To start the operation, an initial estimate x[0] is needed. If v = a * 2 e , a good 
initial estimate is 

x[0] = 1.0*2~ e / 2 

Example 11-18 shows the implementation of this algorithm on the 
TMS320C3x, where the iteration has been applied five times. Both accuracy 
and speed are affected by the number of iterations. If you want more accuracy, 
use more iterations. If you want less accuracy, reduce the number of iterations 
to increase the execution speed. 
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Example 1 1-18. Square Root of a Floating-Point Number 
* 

* TITLE SQUARE ROOT OF A FLOATING-POINT NUMBER 
* 

* 

* SUBROUTINE SQRT 
* 

* THE FLOATING POINT NUMBER V IS STORED IN RO . AFTER THE 

* COMPUTATION IS COMPLETED, SQRT(v) IS ALSO STORED IN RO . NOTE 

* THAT THE ALGORITHM ACTUALLY COMPUTES 1/SQRT(V) . 



* 



TYPICAL CALLING SEQUENCE: 



* LDF v, RO 

* CALL SQRT 

* ARGUMENT ASSIGNMENTS: 

* ARGUMENT | FUNCTION 

* •{ 

* RO 



RO 



V - NUMBER TO FIND THE SQUARE ROOT OF 
(UPON THE CALL) 
SQRT(V) (UPON THE RETURN) 



* REGISTER USED AS INPUT: RO 

* REGISTERS MODIFIED: RO, Rl, R2 , R3 

* REGISTER CONTAINING RESULT: RO 

* CYCLES: 50 WORDS: 39 



.global SQRT 



* EXTRACT THE EXPONENT OF V. 
* 
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SQRT : LDFR0,R3 
RETSLE 
PUSHF RO 
POPR1 
ASH-24,R1 
ADD I 1,R1 
ASH -1,R1 



Save v 

Return if number is non-positive 



The 8 LSBs of Rl contain exponent of v 
Add a rounding bit in the exponent 
e/2 



X[0] FORMATION GIVEN THE EXPONENT OF V. 



NEGI 
ASH 
PUSH 
POPF 



Rl 

24, Rl 

Rl 

Rl 



Now Rl - x[G] ■ 1.0 * 2**(-e/2) 



* 
* 


GENERATE 


V/2. 






■it 


MPYF 


0.5,R0 


} V/2 


and talce roundincf bit out 


* 

it 


NOW THE ITERATIONS BEGIN. 






MPYF 


R1,R1,R2 


f R2 


- x[0] * x[0] 




MPYF 


R0,R2 


f R2 


- (v/2) * x[0] * x[0] 




SUBRF 


1.5,R2 


; R2 


- 1.5 - (v/2) * x[0] * x[0] 




MPYF 


R2,R1 t 


: Rl 


= x[l] = x[0] * 


* 








(1.5 - (v/2)*x[0]*x[0]) 




RND 


Rl 








MPYF 


R1,R1,R2 , 


• R2 


- x[l] * x[l] 




MPYF 


R0,R2 4 


I R2 


- (v/2) * x[l] * x[l] 




SUBRF 


1.5,R2 


f R2 


* 1.5 - (v/2) * x[l] * x[l] 




MPYF 


R2,R1 t 


• Rl 


« x[2] - x[l] * 


* 








(1.5 - (v/2)*x[l]*x[l]) 




RND 


Rl 








MPYF 


R1,R1,R2 , 


• R2 


» x[2] * x[2] 




MPYF 


RO ,R2 t 


; R2 


- (v/2) * x[2] * x[2] 




SUBRF 


1.5,R2 


; R2 


- 1.5 - (v/2) * x[2] * x[2] 




MPYF 


R2 ,R1 , 


• Rl 


- x[3] - x[2] 


it 








*(1.5 - (v/2)*x[2]*x[2J ) 




RND 


Rl 
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MPYF 


Rl ,R1 ,R2 


; R2 




x[3] 


* x[3] 






MPYF 


RO / R2 


R2 


as 




* x r 3 1 


* Yf?l 
X l J J 




SUBRF 


1 . 5 , R2 


R2 


SB 


1.5 - 


(xr/7\ * 




ic r n 

X.I 3 J 


MPYF 


R2 ,R1 


Rl 




v r 4 1 


A[ O J 












* 


t\ 5 

I 1 • D 




X[ J J 




RND 


Rl 














MPYF 


R1,R1,R2 


; R2 


S 


x[4] 


* x[4] 






MPYF 


R0,R2 , 


f R2 




(v/2) 


* x[4] 


* x[4] 




SUBRF 


1.5,R2 t 


• R2 


s 


1.5 - 


(v/2) * 


x[4] * 


x[4] 


MPYF 


R2 ,R1 , 


• Rl 




x[5] 


- x[4] 












* 


(1.5 


- (v/2) 


* x[4] 


* x[4]) 



end 



RND R1,R0 
MPYF R3,R0 
RETS 



Round 

Sqrt(v) from sqrt (v** (-1 ) ) 



.end 
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11.3.6 Extended-Precision Arithmetic 

The TMS320C3x offers 32 bits of precision for integer arithmetic and 24 bits 
of precision in the mantissa for floating-point arithmetic. For higher precision 
in floating-point operations, the eight extended-precision registers R7 to RO 
contain eight additional bits of accuracy. Since no comparable extension is 
available for fixed-point arithmetic, this section shows how you can achieve 
fixed-point double precision by using the capabilities of the processor. The 
technique consists of performing the arithmetic by parts (which is similar to 
performing longhand arithmetic). 

In the instruction set, operations ADDC (add with carry) and SUBB (subtract 
with borrow) use the status carry bit for extended-precision arithmetic. The 
carry bit is affected by the arithmetic operations of the ALU and by the rotate 
and shift instructions. It can also be manipulated directly by setting the status 
register to certain values. For proper operation, the overflow mode bit should 
be reset (OVM = 0) so that the accumulator results are not loaded with the sat- 
uration values. Example 11-19 and Example 11-20 show 64-bit addition and 
64-bit subtraction. The first operand is stored in the registers RO (low word) and 
R1 (high word). The second operand is stored in R2 and R3. The result is 
stored in RO and R1 . 
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Example 11-19. 64-Bit Addition 



TITLE 64-BIT ADDITION 

TWO 64-BIT NUMBERS ARE ADDED TO EACH OTHER, PRODUCING 

A 64-BIT RESULT. THE NUMBERS X (R1,R0) AND Y (R3,R2) ARE 

ADDED, RESULTING IN W (R1,R0). 

Rl RO 
+ R3 R2 

Rl RO 



ADDI R2,R0 
ADDC R3,R1 



Example 1 1-20. 64-Bit Subtraction 



TITLE 6 4 -BIT SUBTRACTION 

TWO 64-BIT NUMBERS ARE SUBTRACTED FROM EACH OTHER 
PRODUCING A 64-BIT RESULT. THE NUMBERS X ( Rl ,R0 ) AND 
Y (R3,R2) ARE SUBTRACTED, RESULTING IN W (R1,R0). 



Rl RO 
- R3 R2 

Rl RO 



SUBI 
SUBB 



R2,R0 
R3,R1 



When two 32-bit numbers are multiplied, a 64-bit product results. The proce- 
dure for multiplication is to split the 32-bit magnitude values of the multiplicand 
X and the multiplier Y into two parts (X1 ,X0) and (X3,X2), respectively, with 1 6 
bits each. The operation is done on unsigned numbers, and the product is ad- 
justed for the sign bit. Example 11-21 shows the implementation of a 32-bit by 
32-bit multiplication. 
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Example 11-21. 32-Bit-by-32-Bit Multiplication 

* 

* TITLE 32 BIT X 32 BIT MULTIPLICATION 
* 

* 

* SUBROUTINE EXTMPY 
* 

* FUNCTION: TWO 32-BIT NUMBERS ARE MULTIPLIED, PRODUCING A 64-BIT 

* RESULT, THE TWO NUMBERS (X and Y) ARE EACH SEPARATED INTO TWO 

* PARTS (XI XO) AND (Yl YO ) , WHERE XO, XI, YO, AND Yl ARE 16 BITS. 

* THE TOP BIT IN XI AND Yl IS THE SIGN BIT. THE PRODUCT IS 

* IN TWO WORDS (WO AND Wl ) . THE MULTIPLICATION IS PERFORMED ON 

* POSITIVE NUMBERS, AND THE SIGN IS DETERMINED AT THE END. 

* 
* 



XI XO BITS OF PRODUCTS 

Yl YO (NOT COUNTING SIGN) PRODUCT 



* X0*Y0 16+16 PI 

* X0*Y1 16+16 P2 

* Xl*Y0 16+16 P3 

* X1*Y1 16+16 P4 

* 

* Wl WO 



ARGUMENT ASSIGNMENTS : 
ARGUMENT | FUNCTION 



RO 
Rl 



MULTIPLIER AND LOW WORD OF THE PRODUCT 
MULTIPLICAND AND UPPER WORD OF THE PRODUCT 



REGISTERS USED AS INPUT: RO, Rl 

REGISTERS MODIFIED: RO, Rl, R2, R3, R4, ARO, AR1 
REGISTER CONTAINING RESULT: R0,Rl 
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CYCLES: 28 (WORST CASE) WORDS: 25 
.global EXTMPY 



EXTMPY 



XOR3 
ABSI 
ABSI 



R0,R1,AR0 

RO 

Rl 



Store sign 

Absolute values of X 
and Y 



SEPARATE MULTIPLIER AND MULTIPLICAND INTO TWO PARTS 



LDI 

LSH3 

AND 

LSH3 

AND 



-16 , ARl 
ARl ,R0 ,R2 
0FFFFH,R0 
ARl ,R1 ,R3 
0FFFFH,R1 



R2 - XI 

RO = XO 
R3 - Yl 
Rl - YO 



CARRY OUT THE MULTIPLICATION 



upper 16 bits of X 

lower 16 bits of X 

upper 16 bits of Y 

lower 16 bits of Y 



MPYI3 


R0,R1,R4 


} X0*Y0 * PI 


MPYI 


R3 ,R0 


f X0*Y1 - P2 


MPYI 


R2,R1 


} X1*Y0 = P3 


ADD I 


R0,R1 


} P2+P3 


MPYI 


R2 ,R3 


) Xl*Yl - P4 


LDI 


R1,R2 




LSH 


16, R2 


; Lower 16 bits of P2+P3 


CMPI 


0 , ARO 


} Check the sign of the product 


BGED 


DONE 


f If >0, multiplication complete 






f (delayed) 


LSH 


-16, Rl 


f Upper 16 bits of P2+P3 


ADD 1 3 


R4,R2,R0 


; WO * RO ■ lower word of the product 


ADDC3 


Rl,R3,Rl 


) Wl * Rl « upper word of the product 



* NEGATE THE PRODUCT IF THE NUMBERS ARE OF OPPOSITE SIGNS 
* 

NOTRO 

ADD I 1,R0 
NOTR1 

AD DC 0,R1 

DONE RETS 
• end 
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11.3.7 IEEE/TMS320C3x Floating-Point Format Conversion 



The fast version of the IEEE-to-TMS320C3x conversion routine was originally 
developed by Keith Henry of Apollo Computer, Inc. The other routines were 
based on this initial input. 

In fixed-point arithmetic, the binary point that separates the integer from the 
fractional part of the number is fixed at a certain location. For example, if a 
32-bit number has the binary point after the most significant bit (which is also 
the sign bit), only fractional numbers (numbers with absolute values less than 
1), can be represented. In other words, there is a number called a Q31 number, 
which is a number with 31 fractional bits. All operations assume that the binary 
point is fixed at this location. The fixed-point system, although simple to imple- 
ment in hardware, imposes limitations in the dynamic range of the represented 
number, which causes scaling problems in many applications. You can avoid 
this difficulty by using floating-point numbers. 

A floating-point number consists of a mantissa m multiplied by base b raised 
to an exponent e: 

m*b e 

In current hardware implementations, the mantissa is typically a normalized 
number with an absolute value between 1 and 2, and the base is b = 2. Al- 
though the mantissa is represented as a fixed-point number, the actual value 
of the overall number floats the binary point because of the multiplication by 
b e . The exponent e is an integer whose value determines the position of the 
binary point in the number. IEEE has established a standard format for the re- 
presentation of floating-point numbers. 

To achieve higher efficiency in hardware implementation, the TMS320C3x 
uses a floating-point format that differs from the IEEE standard. This section 
briefly describes the two formats and presents software routines to convert be- 
tween them. 

TMS320C3X floating-point format: 

8 1 23 



11-42 



Logical and Arithmetic Operations 



In a 32-bit word representing a floating-point number, the first eight bits corre- 
spond to the exponent expressed in two's-complement format. There is one 
bit for sign and 23 bits for the mantissa. The mantissa is expressed in two's- 
complement form, with the binary point after the most significant nonsign bit. 
Since this bit is the complement of the sign bit s, it is suppressed. In other 
words, the mantissa actually has 24 bits. A special case occurs when 
e = -128. In this case, the number is interpreted as 0, independently of the 
values of s and f (which are set to 0 by default). To summarize, the values of 
the represented numbers in the TMS320C3x floating-point format are as fol- 
lows: 

2©*(01.f) if s = 0 

2©*(10.f) if s = 1 

0 if e = -128 

IEEE floating-point format: 

1 8 23 



e 



f 



The IEEE floating-point format uses sign-magnitude notation for the mantissa, 
and the exponent is biased by 127. In a 32-bit word representing a 
floating-point number, the first bit is the sign bit. The next eight bits correspond 
to the exponent, which is expressed in an offset-by-1 27 format (the actual ex- 
ponent is e-127). The following 23 bits represent the absolute value of the 
mantissa with the most significant 1 implied. The binary point is after this most 
significant 1 . In other words, the mantissa actually has 24 bits. There are sev- 
eral special cases, summarized below. 

These are the values of the represented numbers in the IEEE floating-point 
format: 

(-1) s * 2© -127 * (01.f) if 0 < e < 255 

Special cases: 

(-1) s * 0.0 if e = 0 and f = 0 (zero) 

(-1 ) s * 2 -1 26 * (o.f) if e = 0 and f <> 0 (denormalized) 

(-1 ) s * infinity if e = 255 and f = 0 (infinity) 

NaN (not a number) if e = 255 and f <> 0 

Based on these definitions of the formats, two versions of the conversion rou- 
tines were developed. One version handles the complete definition of the for- 
mats. The other ignores some of the special cases (typically the ones that are 
rarely used), but it has the benefit of executing faster than the complete con- 
version. For this discussion, the two versions are referred to as the complete 
version and the fast version, respectively. 
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11.3.7. 1 IEEE-to-TMS320C3x Floating-Point Format Conversion 

Example 11 -22 shows the fast conversion from IEEE to TMS320C3x floating- 
point format. It properly handles the general case when 0 < e < 255, and also 
handles Os (that is, e = 0 and f = 0). The other special cases (denormalized, 
infinity, and NaN) are not treated and, if present, will give erroneous results. 

Example 11-22. IEEE-to-TMS320C3x Conversion (Fast Version) 

* TITLE IEEE TO TMS320C3x CONVERSION (FAST VERSION) 
* 

* SUBROUTINE FMIEEE 
* 

* FUNCTION: CONVERSION BETWEEN THE IEEE FORMAT AND THE 

* TMS320C3X FLOATING-POINT FORMAT. THE NUMBER TO 

* BE CONVERTED IS IN THE LOWER 32 BITS OF R0 . 

* THE RESULT IS STORED IN THE UPPER 32 BITS OF R0 . 

* UPON ENTERING THE ROUTINE, ARl POINTS TO THE 

* FOLLOWING TABLE: 
* 

* (0) 0xFF800000 < — ARl 

* (1) OxFFOOOOOO 

* (2) 0x7F000000 

* (3)0x80000000 

* (4)0x81000000 
* 



* ARGUMENT ASSIGNMENTS: 

* ARGUMENT I FUNCTION 



R0 
ARl 



NUMBER TO BE CONVERTED 

POINTER TO TABLE WITH CONSTANTS 



* REGISTERS USED AS INPUT: R0 , ARl 

* REGISTERS MODIFIED: R0, Rl 

* REGISTER CONTAINING RESULT: R0 



* 



NOTE: SINCE THE STACK POINTER SP IS USED, MAKE SURE TO 
INITIALIZE IT IN THE CALLING PROGRAM. 



* CYCLES: 12 (WORST CASE) WORDS: 12 



.global FMIEEE 

* 
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FMXEEE 



NEG 
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ASiUo 


n A 4*n 1 D1 

KU / AJKl , Kl 


K@pl.aGe HaCulOn WlLn u 


nMn 
dNU 




Test sign 


ADD I 


R0,R1 , 


Shift sign 






and exponent inserting 0 


T F\ T 17 


*+ARl ( 1 ) fRl 


* ii ail u, generate l.ju u 


CTTDT 


W +AK1 ( Z ; r Kl 


Unbias exponent 


PTIQW 
ruon 


P 1 
fix 




POPF 


RO 


; Load this as a fit. pt. number 


RETS 






PUSH 


Rl 




POPF 


RO 


; Load this as a fit. pt. number 


NEGF 


RO ,R0 


; Negate if orig. sign is negative 


RETS 
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Example 11-23. 

* 
* 
* 
* 
★ 

* 
* 
* 
* 
* 

* 

* 
* 
* 
* 
* 

* 
* 
* 
* 

* 

* 

* 
* 
* 
* 

* 

* 
* 
* 



Example 11-23 shows the complete conversion between the IEEE and 
TMS320C3x formats. In addition to the general case and the Os, it handles the 
special cases as follows: 

□ If NaN (e = 255, f< >0), the number is returned intact. 

□ If infinity (e = 255, f = 0), the output is saturated to the most positive or 
negative number, respectively. 

□ If denormalized (e = 0, f< >0), two cases are considered. If the MSB of 
f is 1 , the number is converted to TMS320C3x format. Otherwise, an un- 
derflow occurs, and the number is set to 0. 

IEEE-to-TMS320C3x Conversion (Complete Version) 

TITLE IEEE TO TMS320C3X CONVERSION (COMPLETE VERSION) 



SUBROUTINE FMIEEEl 



FUNCTION: CONVERSION BETWEEN THE IEEE FORMAT AND THE TMS320C3x 
FLOATING-POINT FORMAT. THE NUMBER TO BE CONVERTED 
IS IN THE LOWER 32 BITS OF RO . THE RESULT IS STORED 
IN THE UPPER 32 BITS OF RO . 



UPON ENTERING THE ROUTINE, ARl POINTS TO THE FOLLOWING TABLE: 



(0) 0xFF8000Q0 < — ARl 

(1) OxFFOOOOOO 

(2) 0x7F000000 

(3) 0x80000000 

(4) 0x81000000 

(5) 0x7F800000 

(6) 0x00400000 

(7) 0x007FFFFF 

(8) 0x7F7FFFFF 



ARGUMENT ASSIGNMENTS : 
ARGUMENT | FUNCTION 



R0 
ARl 



NUMBER TO BE CONVERTED 

POINTER TO TABLE WITH CONSTANTS 



REGISTERS USED AS INPUT: R0 , ARl 
REGISTERS MODIFIED: R0, Rl 
REGISTER CONTAINING RESULT: R0 
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* NOTE: SINCE THE STACK POINTER SP IS USED, MAKE SURE TO 

* INITIALIZE IT IN THE CALLING PROGRAM. 



* CYCLES: 23 (WORST CASE) 
* 

.global FMIEEE1 



WORDS: 34 



LDI RO ,Rl 
*+ARl(5) ,R1 
UNNORM 



FMIEEE1 
AND 
BZ 
★ 

XOR *+ARl(5),Rl 
BNZ NORMAL 



If e = 0, number is either 0 or 
denormalized 

If e < 255, use regular routine 



HANDLE NaN AND INFINITY 



TSTB *+ARl(7),R0 

RETSNZ 

LDI RO ,R0 

LDFGT *+ARl(8),R0 



LDFN *+ARl(5),R0 
RETS 



Return if NaN 

If positive, infinity ■ 
most positive number 

If negative, infinity « 

most negative number RETS 



* HANDLE 0s AND UNNORMALI ZED NUMBERS 



UNNORM 



NEG1 



TSTB 
LDFZ 
RETSZ 

XOR 
BND 
LSH 

SUBI 
PUSH 
POPF 
RETS 
POPF 
NEGF 
RETS 



*+ARl(6) ,R0 
*+ARl(3) ,R0 

*+ARl(6) ,R0 

NEG1 

1,R0 

*+ARl(2) ,R0 

RO 

RO 

RO 

R0,R0 



Is the MSB of f equal to 1? 
If not, force the number to 0 
and return 

If MSB of f = 1, make it 0 

Eliminate sign bit 

& line up mantissa 
Make e = -127 

Put number in floating point format 
If negative, negate RO 
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* HANDLE 
ic 


THE 


REGULAR CASES 




NORMAL 


AND 3 


R0,*AR1,R1 ; 


; Replace fraction with 0 




BND 


NEG ; 


} Test sign 




ADD I 


RO ,R1 ; 


} Shift sign and exponent inserting 0 




SUBI 


*+ARl(2),Rl j 


} Unbias exponent 




PUSH 


Rl 






POPF 


RO ] 


} Load this as a fit. pt. number 




RETS 






NEG 


POPF 


RO j 


} Load this as a fit. pt. number 




NEGF 


R0,R0 | 


} Negate if original sign negative 



RETS 
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11.3.7.2 TMS320C3x-to-IEEE Floating-Point Format Conversion 

The vast majority of the numbers represented by the TMS320C3x 
floating-point format are covered by the general IEEE format and the repre- 
sentation of Os. The only special case is e = -1 27 in the TMS320C3x format; 
this corresponds to a denormalized number in IEEE format. It is ignored in the 
fast version, while it is treated properly in the complete version. 
Example 11-24 shows the fast version, and Example 11-25 shows the com- 
plete version of the TMS320C3x-to-IEEE conversion. 



Example 11-24. TMS320C3x-to-IEEE Conversion (Fast Version) 
* 

* TITLE TMS320C3X TO IEEE CONVERSION (FAST VERSION) 
* 

* 

* SUBROUTINE TOIEEE 
* 

* FUNCTION: CONVERSION BETWEEN THE TMS320C3X FORMAT AND THE IEEE 

* FLOATING-POINT FORMAT. THE NUMBER TO BE CONVERTED 

* IS IN THE UPPER 32 BITS OF RO . THE RESULT WILL BE IN 

* THE LOWER 32 BITS OF RO . 



* UPON ENTERING THE ROUTINE, ARl POINTS TO THE FOLLOWING TABLE: 

* (0) 0xFF800000 < — ARl 

* (1) OxFFOOOOOO 

* (2) 0x7F000000 

* (3)0x80000000 

* (4)0x81000000 



ARGUMENT ASSIGNMENTS : 
ARGUMENT | FUNCTION 



R0 
ARl 



NUMBER TO BE CONVERTED 

POINTER TO TABLE WITH CONSTANTS 



* REGISTERS USED AS INPUT: R0 # ARl 

* REGISTERS MODIFIED: R0 

* REGISTER CONTAINING RESULT: R0 



* NOTE: SINCE THE STACK POINTER 'SP' IS USED, MAKE SURE TO 

* INITIALIZE IT IN THE CALLING PROGRAM. 

* 
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* CYCLES: 14 (WORST CASE) WORDS: 15 
* 



. global 


TOIEEE 




LDF 


RO ,R0 


; Determine the sign of the number 


LDFZ 


*+ARl(4) ,R0 


; If 0, load appropriate number 


BND 


NE6 


; Branch to NEG if negative (delayed) 


ABSF 


RO 


; Take the absolute value of the number 


LSH 


1, RO 


• Eliminate the pi ion hit in RO 


PUSHF 


RO 




POP 


RO 


} Place number in lower 32 bits of RO 


ADDI 




• Add exnonent bias fl27l 


LSH 


-1^ RO 


; Add the positive sign 


RETS 






POP 


RO 


; Place number in lower 32 bits 






; of RO 


ADDI 


*+ARl(2) ,R0 


; Add exponent bias (127) 


LSH 


— 1 r RO 


• Make space for the sign 


ADDI 


*+ARl(3) ,R0 


; Add the negative sign 



RETS 
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Example 11-25. TMS320C3x-to-IEEE Conversion (Complete Version) 
* 

* TITLE TMS320C3X TO IEEE CONVERSION (COMPLETE VERSION) 



* 

* SUBROUTINE TOIEEEl 



FUNCTION: CONVERSION BETWEEN THE TMS320C3X FORMAT AND THE IEEE 
FLOATING-POINT FORMAT. THE NUMBER TO BE CONVERTED 
IS IN THE UPPER 32 BITS OF RO . THE RESULT WILL BE 
IN THE LOWER 32 BITS OF RO . 



* UPON ENTERING THE ROUTINE, ARl POINTS TO THE FOLLOWING TABLE: 
* 



* (0) 0xFF800000 < — ARl 

* (1) OxFFOOOOOO 

* (2) 0x7F000000 

* (3)0x80000000 

* (4)0x81000000 

* (5) 0x7F800000 

* (6)0x00400000 

* (7) 0X007FFFFF 

* (8) 0X7F7FFFFF 



ARGUMENT ASSIGNMENTS : 
ARGUMENT | FUNCTION 



R0 
ARl 



NUMBER TO BE CONVERTED 

POINTER TO TABLE WITH CONSTANTS 



* REGISTERS USED AS INPUT: R0, ARl 

* REGISTERS MODIFIED: R0 

* REGISTER CONTAINING RESULT: R0 
* 

* NOTE: SINCE THE STACK POINTER 'SP' IS USED, MAKE SURE TO 

* INITIALIZE IT IN THE CALLING PROGRAM. 
* 

* 

* CYCLES: 31 (WORST CASE) WORDS: 25 
* 

.global TOIEEEl 
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TOIEEEl 



CONT 



NEG 



LDF 


RO, RO 


LDFZ 


*+ARl ( 4 ) , RO 


BND 


NEG 


ABSF 


RO 


LSH 


1,R0 


PUSHF 


RO 


POP 


RO 


ADD I 


*+ARl(2) ,R0 


LSH 


-1,R0 


TSTB 


*+ARl ( 5 ) , RO 


RETSNZ 




TSTB 


*+ARl ( 7 ) , RO 


RETSZ 




PUSH 


RO 


POPF 


RO 


LSH 


-1,R0 


PUSHF 


RO 


POP 


RO 


ADD I 


*+ARl(6) ,R0 


RETS 




POP 


RO 


BRD 


CONT 


ADD I 


*+ARI(2) ,R0 


LSH 


-1,R0 


ADD I 


*+ARl(3) ,R0 


RETS 





Determine the sign of the number 
If 0, load appropriate number 
Branch to NEG if negative (delayed) 
Take the absolute value 

of the number 
Eliminate the sign bit in RO 

Place number in lower 32 bits of RO 
Add exponent bias (127) 
Add the positive sign 

If e > 0, return 
lfe*0&f*0, return 

Shift f right by one bit 

Add 1 to the MSB of f 

Place number in lower 32 bits of RO 

Add exponent bias (127) 
Make space for the sign 
Add the negative sign 
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11.4 Application-Oriented Operations 

Certain features of the TMS320C3x architecture and instruction set facilitate 
the solution of numerically intensive problems. This section presents exam- 
pies of applications using these features, such as companding, filtering, FFTs, 
and matrix arithmetic. 

11.4.1 Companding 

In telecommunications, conserving channel bandwidth while preserving 
speech quality is a primary concern. This is achieved this by quantizing the 
speech samples logarithmically. An 8-bit logarithmic quantizer produces 
speech quality equivalent to a 1 3-bit uniform quantizer. The logarithmic quanti- 
zation is achieved by companding (COMpress/exPANDing). Two international 
standards have been established for companding: the n-law standard (used 
in the United States and Japan), and the A-law standard (used in Europe). De- 
tailed descriptions of n law and A law companding are presented in an applica- 
tion report on companding routines included in the book Digital Signal Pro- 
cessing Applications with the TMS320 Family (literature number SPRA01 2A). 

During transmission, logarithmically compressed data in sign-magnitude form 
is transmitted along the communications channel. If any processing is neces- 
sary, you should expand this data to a 14-bit (for [i law) or 13-bit (for A law) 
linear format. This operation is performed when the data is received at the digi- 
tal signal processor. After processing, the result is compressed back to 8-bit 
format and transmitted through the channel to continue transmission. 

Example 11-26 and Example 11-27 show n-law compression and expansion 
(that is, linear to i^-law and pi-law to linear conversion), while Example 11-28 
and Example 11-29 show A-law compression and expansion. For expansion, 
using a look-up table is an alternative approach. A look-up table trades 
memory space for speed of execution. Since the compressed data is eight bits 
long, you can construct a table with 256 entries containing the expanded data. 
If the compressed data is stored in the register ARO, the following two instruc- 
tions will put the expanded data in register RO: 

ADD I @TABL, ARO ; @TABL - BASE ADDRESS OF TABLE 
LDI*AR0,R0 ; PUT EXPANDED NUMBER IN RO 

You could use the same look-up table approach for compression, but the re- 
quired table length would then be 16,384 words for ^i-law or 8,192 words for 
A-law. If this memory size is not acceptable, use the subroutines presented in 
Example 11-26 or Example 11-28. 



Software Applications 1 1 -53 



Application-Oriented Operations 



Example 11-26. \i-Law Compression 



TITLE U-LAW COMPRESSION 



SUBROUTINE MUCMPR 



ARGUMENT ASSIGNMENTS : 
ARGUMENT | FUNCTION 



RO 



NUMBER TO BE CONVERTED 



REGISTERS USED AS INPUT: RO 
REGISTERS MODIFIED: RO , Rl, R2 , SP 
REGISTER CONTAINING RESULT: RO 



NOTE: SINCE THE STACK POINTER 'SP' IS USED IN THE COMPRESSION 
ROUTINE 'MUCMPR', MAKE SURE TO INITIALIZE IT IN THE 
CALLING PROGRAM. 



CYCLES : 20 



WORDS: 17 



.global MUCMPR 



MUCMPR 



LDI 
ABSI 
CMPI 
LDIGT 
ADD I 



RO f Rl 
RO f RO 
1FDEH,R0 
1FDEH, RO 
33, RO 



Save sign of number 

If R0>0xlFDE, 
saturate the result 
Add bias 



FLOAT 


RO 


; Normalize : ( seg+5 ) OWXYZx . . . x 


MPYF 


0. 03125, RO 


; Adjust segment number by 2** (-5) 


LSH 


1,R0 


; (seg)WXYZx. . .x 


PUSHF 


RO 




POP 


RO 


; Treat number as integer 


LSH 


-20, RO 


? Right-justify 



LDI 

LDI 

LDILT 

ADD I 

NOT 

RETS 



0,R2 

R1,R1 

80H,R2 

R2,R0 

RO 



If number is negative, 

set sign bit 
RO = compressed number 
Reverse all bits for transmission 
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Example 1 1-27. \i-Law Expansion 
* 

* TITLE U-LAW EXPANSION 
* 

* 

* SUBROUTINE MUXPND 
* 

* 

* ARGUMENT ASSIGNMENTS : 
* 

* ARGUMENT | FUNCTION 

* -| 

* RO | NUMBER TO BE CONVERTED 
★ 

* REGISTERS USED AS INPUT; RO 

* REGISTERS MODIFIED: RO, Rl, R2 , SP 

* REGISTER CONTAINING RESULT: RO 
* 

* CYCLES: 20 (WORST CASE ) WORDS : 14 
* 



.global MUXPND 



MUXPND 



NOT 


RO ,R0 


r 


Complement bits 


LDI 


RO f Rl 






AND 


0FH,R1 


? 


Isolate quantization bin 


LSH 


l,Rl 






ADD I 


33, Rl 


f 


Add bias to introduce Ixxxxl 


LDI 


R0,R2 


f 


Store for sign bit 


LSH 


-4,R0 






AND 


7,R0 


i 


Isolate segment code 


LSH3 


R0,R1,R0 


r 


Shift and put result in RO 


SUBI 


33, RO 


t 


Subtract bias 


TSTB 


80H,R2 


t 


Test sign bit 


RETSZ 








NEGI 


RO 


i 


Negate if a negative number 


RETS 
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Example 11-28. A-Law Compression 



TITLE A-LAW COMPRESSION 
SUBROUTINE ACMPR 

ARGUMENT ASSIGNMENTS s 
ARGUMENT | FUNCTION 



RO 



| NUMBER TO BE CONVERTED 



REGISTERS USED AS INPUT: RO 
REGISTERS MODIFIED: RO, Rl, R2 , SP 
REGISTER CONTAINING RESULT: RO 



NOTE: SINCE THE STACK POINTER 'SP' IS USED IN THE COMPRESSION 
ROUTINE 'ACMPR', MAKE SURE TO INITIALIZE IT IN THE 
CALLING PROGRAM. 



* CYCLES: 22 WORDS: 19 
* 



.global ACMPR 



ACMPR 



LDI 

ABSI 

CMPI 

BLED 

CMPI 

LDIGT 

LSH 



R0,R1 
R0,R0 
1FH,R0 
END 

0FFFH,R0 
OFFFH/RO 
-1,R0 



Save sign of number 

If R0<0x20, 

do linear coding 
If R0>0xFFF, 

saturate the result 
Eliminate rightmost bit 



FLOAT 

MPYF 

LSH 

PUSHF 

POP 

LSH 



RO 

0. 125.R0 

1, R0 
RO 
RO 

-20, RO 



Normalize : ( seg+3 ) OWXYZx . . . x 
Adjust segment number by 2** (-3) 
(seg)WXYZx. . .x 

Treat number as integer 
Right-justify 



END 



LDI 
LDI 
LDILT 
ADD I 
XOR 

RETS 



0,R2 

R1,R1 

80H,R2 

R2,R0 

0D5H,R0 



If number is negative, 

set sign bit 
RO = compressed number 
Invert even bits 

for transmission 
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Example 11-29. A-Law Expansion 
* 

* TITLE A-LAW EXPANSION 
* 

* 
* 

* SUBROUTINE AXPND 
* 

* ARGUMENT ASSIGNMENTS: 

* ARGUMENT | FUNCTION 

* h 

* RO | NUMBER TO BE CONVERTED 

* REGISTERS USED AS INPUT: RO 

* REGISTERS MODIFIED: RO, Rl, R2 , SP 

* REGISTER CONTAINING RESULT: RO 
* 

* 

* CYCLES: 25 (WORST CASE) WORDS: 16 
* 

.global AXPND 

★ 



XOR 


D5H,R0 




Invert even bits 


LDI 


RO f Rl 






AND 


0FH,R1 




Isolate quantization bin 


LSH 


1,R1 






LDI 


RO ,R2 




Store for bit sign 


LSH 


-4,R0 






AND 


7,R0 




Isolate segment code 


BZ 


SKIP1 






SUBI 


1,R0 






ADD I 


32, Rl 


? 


Create lxxxxl 


ADD I 


1,R1 


} 


OR Oxxxxl 


LSH3 


RO f Rl , RO 


} 


Shift and put result in RO 


TSTB 


80H,R2 


r 


Test sign bit 


RETSZ 








NEGI 


RO 


} 


Negate if a negative number 


RETS 
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11.4.2 FIR, NR 9 and Adaptive Filters 

Digital filters are a common requirement for digital signal processing systems. 
There are two types of digital filters: finite impulse response (FIR) and infinite 
impulse response (IIR). Each of these types can have either fixed or adaptable 
coefficients. This section presents the fixed-coefficient filters first, followed by 
the adaptive filters. 



11.4.2.1 FIR Filters 



If the FIRfilter has an impulse response h [0], h [1] h [N -1], and x[n] repre- 
sents the input of the filter at time n, the output y [n] at time n is given by this 
equation: 

y [n] = h [0] x [n] + h [1] x [n -1] + ...+ h [N -1] x [n - (N -1)] 

Two features of the TMS320C3x that facilitate the implementation of the FIR 
filters are parallel multiply/add operations and circular addressing. The former 
permits the performance of a multiplication and an addition in a single machine 
cycle, while the latter makes a finite buffer of length N sufficient for the data x. 

Figure 11-1 shows the arrangement of the memory locations necessary to im- 
plement circular addressing, while Example 11-30 presents the TMS320C3x 
assembly code for an FIR filter. 



Figure 11-1. Data Memory Organization for an FIR Filter 



Low 
Address 



High 
Address 



Impulse 
Response 




Initial 
Input Samples 


Final 
Input Samples 


h(N-1) 


Oldest Input 


x[n-(N-1)] 




x(n) 


h(N-2) 


x[n-(N-2)] 




x[n-(N-1)] 


• 




• 


• 


0 






• 


• 




• 


• 


ho) 




x(n-1) 




x(n-2) 


h(0) 


Newest Input 


x(n) 




x(n-1) 



Queue 



To set up circular addressing, initialize the block-size register BK to block 
length N. Also, the locations for signal x should start from a memory location 
whose address is a multiple of the smallest power of 2 that is greater than N. 
For instance, if N = 24, the first address for x should be a multiple of 32 (the 
lowest five bits of the beginning address should be 0). See Section 5.3 on page 
5-24 for more information. 
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In Example 11-30, the pointer to the input sequence x is incremented and is 
assumed to be moving from an older input to a newer input. At the end of the 
subroutine, AR1 will be pointing to the position for the next input sample. 



Example 11-30. FIR Filter 



TITLE FIR FILTER 



SUBROUTINE FIR 



EQUATION: y(n) = h(0) * x(n) + h(l) * x(n-l) + 
... + h(N-l) * x(n-(N-l)) 

TYPICAL CALLING SEQUENCE: 



LOAD ARO 

LOAD AR1 

LOAD RC 

LOAD BK 

CALL FIR 



ARGUMENT ASSIGNMENTS : 
ARGUMENT | FUNCTION 



ARO 
AR1 
RC 
BK 



ADDRESS OF h(N-l) 
ADDRESS OF x(n-(N-l)) 
LENGTH OF FILTER - 2 (N-2) 
LENGTH OF FILTER (N) 



REGISTERS USED AS INPUT: ARO, ARl, RC f BK 
REGISTERS MODIFIED: R0, R2 , ARO, ARl, RC 
REGISTER CONTAINING RESULT: R0 



* CYCLES: 11 + (N-l) WORDS: 6 



•global FIR 

9 

FIR MPYF3 *AR0++(1) , *ARl++ ( 1 ) % ,R0 
LDF 0.0, R2 



FILTER (1 <= i < N) 



Initialize R0: 

h(N-l) * x(n-(N-i)) -> R0 
Initialize R2 



II 



RPTS RC 

MPYF3 *AR0++ ( 1 ) , *AR1++ ( 1 ) % , R0 
ADDF3 R0,R2,R2 



Set up the repeat cycle 
h ( N-l-i } *x ( n- ( N- 1-i ) ) ->R0 
Multiply and add operation 
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ADDF R0,R2,R0 



Add last product 



* RETURN SEQUENCE 



RETS 



Return 



end 



• end 



11.4.2.2 IIR Filters 



The transfer function of the IIR filters has both poles and Os. Its output depends 
on both the input and the past output. As a rule, the filters need less computa- 
tion than an FIR with similar frequency response, but the filters have the draw- 

uaor\ ui uon iy oci loiuvo isj owoiiiuigm ii v^uai iu4.«ui/i i. iviw«ji whui i, u iw ■■■ 1 nuui mi w 

implemented as a cascade of second-order sections, called biquads. 
Example 11-31 and Example 11 -32 show the implementation for one biquad 
and for any number of biquads, respectively. 

This is the equation for a single biquad: 

y[n] = a1 y [n - 1] + a2 y [n - 2] + bO x [n ] + b1 x [n -1] + b2 x [n -2] 

However, the following two equations are more convenient and have smaller 
storage requirements: 

d [n] = a2 d [n - 2] + a1 d [n -1] + x [n] 
y [n] = b2 d [n - 2] + b1 d [n - 1] + bO d [n] 

Figure 11-2 shows the memory organization for this two-equation approach, 
and Example 11-31 is an implementation of a single biquad on the 
TMS320C3X. 



Figure 11-2. Data Memory Organization for a Single Biquad 



Low 
Address 



High 
Address 



Filter 
Coefficients 



a2 



b2 



a1 



b1 



bO 



Newest Delay 
Oldest Delay 



Newest Delay 
Node Values 


Newest Delay 
Node Values 


d(n) 




d(n-1) 


d(n-1) 




d(n-2) 


d(n-2) 




d(n) 



Circular Queue 



As in the case of FIR filters, the address for the start of the values d must be 
a multiple of 4; that is, the last two bits of the beginning address must be 0. The 
block-size register BK must be initialized to 3. 
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Example 11-31. IIR Filter (One Biquad) 



TITLE IIR FILTER 

SUBROUTINE IIR 1 

IIR1 — IIR FILTER (ONE BIQUAD) 



EQUATIONS: d(n) « a2 * d(n-2) + al * d(n-l) + x(n) 
y(n) - b2 * d(n-2) + bl * d(n-l) + bO * d(n) 

OR y(n) « al*y(n-l) + a2*y(n-2) + bO*x(n) 
+ bl*x(n-l) + b2*x(n-2) 

TYPICAL CALLING SEQUENCE : 

load R2 

load ARO 

load AR1 

load BK 

CALL IIR1 



ARGUMENT ASSIGNMENTS : 
ARGUMENT | FUNCTION 



R2 
ARO 
AR1 
BK 



INPUT SAMPLE X(N) 

ADDRESS OF FILTER COEFFICIENTS (A2) 
ADDRESS OF DELAY MODE VALUES (D(N-2)) 
BK = 3 



REGISTERS USED AS INPUT: R2, ARO, AR1 , BK 
REGISTERS MODIFIED: RO, Rl, R2, ARO, ARl 
REGISTER CONTAINING RESULT: RO 



CYCLES: 11 



WORDS: 8 



FILTER 
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.global IIR1 



IIRl MPYF3 *AR0,*AR1,R0 



II 



MPYF3 *++AR0 ( 1 ) , *AR1 — (1) % ,R1 



MPYF3 *++AR0 ( 1 ) , *AR1 , RO 
ADDF3 R0,R2,R2 



a2 * d(n-2) -> RO 

b2 * d(n-2) -> Rl 

al * d(n-l) -> RO 
a2*d(n-2)+x(n) -> R2 



MPYF3 *++AR0 ( 1 ) , *AR1 — (1)%,R0 ; 
ADDF3 R0,R2,R2 ; 



bl * d(n-l) -> RO 
al*d(n-l)+a2*d(n-2)+x(n) -> R2 



MPYF3 *++AR0 ( 1 ) , R2 , R2 
STF R2,*AR1++(1)% 



ADDF R0,R2 
ADDF R1,R2,R0 



RETURN SEQUENCE 
RETS 



bO * d(n) -> R2 



Store d ( n ) and point to d(n-l) 

bl*d(n-l)+bO*d(n) -> R2 
b2*d(n-2)+bl*d(n-l) 
+bO*d(n) -> RO 



Return 



end 



.end 



In the more general case, the IIR filter contains N>1 biquads. The equations 
for its implementation are given by the following pseudo-C language code: 

y[0,n] = x[n] 

for (i = 0; i < N; i ++){ 

d [i,n] = a2 [i] d [i, n - 2] + a1 [i] d [i,n -1] + y [i - 1 ,n] 
y [i,n] = b2 [i] d [i - 2] + b1 [i] d [i,n - 1] + bO [i] d [i,n] 

} 

y[n] = y[N -1,n] 

Figure 11-3 shows the corresponding memory organization, while 
Example 11-32 shows the TMS320C3x assembly-language code. 
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Figure 11-3. Data Memory Organization for N Biquads 



Low 
Address 



High 
Address 



Filter 
Coefficients 



a2(0) 



b2(0) 



a1(0) 



b1(0) 



b0(0) 



a2(N -1) 



b2(N -1) 



a1(N -1) 



b1(N -1) 



bO(N -1) 



Newest Delay 



Oldest Delay 



Initial Delay 
Node Values 



d(0, n) 



d(0,n-1) 



d(0, n-2) 



Empty 



d(N -1,n) 



d(N -1,n-1) 



d(N -1,n-2) 



Empty 



Final Delay 
Node Values 



d(0,n-1) 



d(0, n-2) 



d(0, n) 



Empty 



d(N -1,n-1) 



d(N -1,n-2) 



d(N -1, n) 



Empty 



Circular Queue 



Circular Queue 



You should initialize the block register BK to 3; the beginning of each set of d 
values (that is, d [i,n ], i = 0...N - 1) should be at an address that is a multiple 
of 4 (where the last two bits are 0). 
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Example 11-32. IIR Filters (N > 1 Biquads) 

* 

* TITLE IIR FILTERS (N > 1 BIQUADS) 

* SUBROUTINE IIR2 

* EQUATIONS: y(0,n) = x(n) 

* FOR (i - 0; i < N; i++) 

{ 

* d(i,n) = a2(i) * d(i,n-2) + al(i) * d(i,n-l) * y(i-l,n) 

* y(i/n) - b2(i) * d(i,n-2) + bl(i) * d(i,n-l) * b0(i) * d(i,n) 

* TYPICAL CALLING SEQUENCE: 
} 

* y(n) = y(N-l,n) 

* TYPICAL CALLING SEQUENCE: 





load 


R2 


* 


load 


ARO 




load 


AR1 


* 


load 


IRO 


★ 


load 


IR1 




load 


BK 


* 


load 


RC 


* 


CALL 


IIR2 



* 



ARGUMENT ASSIGNMENT: 



ARGUMENT 



FUNCTION 



R2 

ARO 

AR1 

BK 

IRO 

IR1 

RC 



INPUT SAMPLE x(n) 

ADDRESS OF FILTER COEFFICIENTS (a2(0)) 
ADDRESS OF DELAY NODE VALUES (d(0,n-2)) 
BK - 3 
IRO = 4 
IR1 = 4*N-4 

NUMBER OF BIQUADS (N) -2 



* REGISTERS USED AS INPUT; R2, ARO, ARl, IRO, IR1, BK, RC 

* REGISTERS MODIFIED; R0, Rl, R2, ARO , ARl , RC 

* REGISTERS CONTAINING RESULT: R0 
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* CYCLES: 17 + 6N WORDS; 17 
* 

* 
* 

.global IIR2 



IIR2 


MPYF3 


*AR0, *AR1 , RO 












1 


a2(0) * d(0,n-2) -> RO 




MPYF3 


*AR0++fll. *ARl M \ % Rl 






* 






§ 


b2 ( 0 \ * dfO n— 2 \ — > Rl 


* 


MPYF3 


*++AR0(l) ,*ARl,R0 


i 


al(0) * D(0,n-1) -> RO 


II 

* 


ADDF 


RO, R2, R2 


r 


First sum term of d(0,n) 




MPYF3 


*++AR0 ( 1 ) , *AR1 — (1)%,R0 


9 


bl(0) * d(0,n-l) -> RO 


II 


ADDF3 


RO, R2, R2 


r 


Second sum term of d(0,n) 




MPYF3 


*++AR0(l) ,R2,R2 


t 


b0(0) * d(0,n) -> R2 


1 1 
1 1 
* 


Oil 


JC\Z f w rl2\.x y x ) « 




* 






i 


Store d(0,n); 








} 


point to 








9 


d(0,n-2) 


* 


RPTB 


LOOP 


9 


Loop for 1 <= i < n 


II 

* 


MPYF3 


*++AR0 ( 1 ) , *++ARl ( IRO ) , RO 


9 


a2(i) * d(i,n-2) -> RO 


ADDF 3 


R0,R2,R2 


9 


First sum term of y(i-l,n) 




MPYF3 


*++AR0 ( 1 ) , *AR1 — ( 1 ) %R1 


9 


b2(i) * D(i,n-2) -> Rl 


II 


ADDF 3 


R1,R2,R2 


9 


Second sum term 


* 






9 


of y(i-l,n) 




MPYF3 


*++AR0(l) , *AR1,R0 


9 


al(i) * d(i,n-l) -> RO 


II 


ADDF 3 


R0,R2,R2 


9 


First sum of d(i,n) 




MPYF3 


*++AR0 ( 1 ) , *AR1 — ( 1 ) % , RO 


9 


bl(i) * d(i,n-l) -> RO 


1 1 
1 1 
* 


ADDF J 


RO ,R2 ,R2 


9 


Second sum term of d(i,n) 




STF 


R2, *AR1 (1)% 












9 


Store d(i,n); 








9 


point to d(i,n-2) 


LOOP 


MPYF3 


*++AR0(l), R2,R2 






* 
* 






9 


bO(i) * d(i,n) -> R2 
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* FINAL SUMMATION 



ADDF R0,R2 
ADDF3 Rl,R2,R0 



NOP *AR1— (IR1) 
NOP *AR1 — (1)% 



* RETURN SEQUENCE 
★ 



RETS 



* end 

* 



First sum term of y(n-l,n) 
Second sum term 
of y(n-l,n) 

Return to first biguad 
Point to d(0,n-l) 



Return 
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11.4.2.3 Adaptive Filters (LMS Algorithm) 

In some applications in digital signal processing, you must adapt a filter over 
time to keep track of changing conditions. The book Theory and Design of 
Adaptive Filters by Treichler, Johnson, and Larimore (Wiley-lnterscience, 
1 987) presents the theory of adaptive filters. Although in theory, both FIR and 
IIR structures can be used as adaptive filters, the stability problems and the 
local optimum points that the IIR filters exhibit make them less attractive for 
such an application. Hence, until further research makes IIR filters a better 
choice, only the FIR filters are used in adaptive algorithms of practical applica- 
tions. 

In an adaptive FIR filter, the filtering equation takes this form: 

y [n] = h [n,0] x [n] + h [n,1] x [n - 1] + ... + h [n,N -1]x[n-(N- 1)] 

The filter coefficients are time-dependent. In a least-mean-squares (LMS) al- 
gorithm, the coefficients are updated by an equation in this form: 

h[n + 1,i] = h[n,i]+ px[n-i], i = 0,1,...,N-1 

p is a constant for the computation. You can interleave the updating of the filter 
coefficients with the computation of the filter output so that it takes three cycles 
per filter tap to do both. The updated coefficients are written over the old filter 
coefficients. Example 1 1-33 shows the implementation of an adaptive FIR fil- 
ter on the TMS320C3x. The memory organization and the positioning of the 
data in memory should follow the same rules that apply to the FIR filter de- 
scribed in subsection 11.4.2.1 on page 11-58. 
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Example 11-33. 

* 

it 
* 

* 
* 
* 

* 

* 

* 
* 

* 
* 
* 

* 
* 
it 
it 

* 

* 
* 
* 

* 

* 



Adaptive FIR Filter (LMS Algorithm) 

TITLE ADAPTIVE FIR FILTER (LMS ALGORITHM) 

SUBROUTINE LMS 

LMS == LMS ADAPTIVE FILTER 



EQUATIONS: y(n) = h(n,0)*x(n) + h(n, 1 ) *x(n-l ) + ... 
+ h(n,N-l)*x(n-(N-l) ) 
FOR (i = 0; i < N; i++) 

h(n+l,i) = h(n,i) + tmuerr * x(n-i) 



TYPICAL CALLING SEQUENCE: 



load R4 

load ARO 

load ARl 

load RC 

load BK 

CALL LMS 



ARGUMENT ASSIGNMENTS : 



ARGUMENT 


| FUNCTION 
-H 




R4 


| SCALE FACTOR (2 * mu 


* err) 


ARO 


| ADDRESS OF h(n,N-l) 




ARl 


j ADDRESS OF x(n-(N-l)) 




RC 


| LENGTH OF FILTER - 2 


(N-2) 


BK 


| LENGTH OF FILTER (N) 
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REGISTERS USED AS INPUT : R4, ARO, ARl, RC, BK 
REGISTERS MODIFIED: RO, Rl, R2 , ARO, ARl, RC 
REGISTER CONTAINING RESULT: RO 

PROGRAM SIZE: 10 words 

EXECUTION CYCLES: 14 + 3(N-1) 



SETUP (i * 0) 



•global LMS 

LMS MPYF3 *AR0, *ARl, RO 
LDF 0.0 ,R2 

MPYF3 *AR1++(1)%, R4, Rl 
ADDF3 *AR0++(1), Rl, Rl 



Initialize RO: 

h(n,N-l) * x(n-(N-l)) -> RO 
Initialize R2 

Initialize Rl: 

x(n-(N-l)) * tmuerr -> Rl 

h(n,N-l) + x(n-(N-l)) * 
tmuerr -> Rl 



FILTER AND UPDATE (1 <» I < N) 



II 

LOOP 



RPTB LOOP 

MPYF3 * ARO — ( 1 ) , * ARl , RO 
ADDF3 R0,R2,R2 



MPYF3 *AR1++ ( 1 ) % , R4 , Rl 
STF R1,*AR0++(1) 

ADDF3 *AR0++(1), Rl, Rl 



ADDF3 R0,R2,R0 
STF Rl,*-AR0(1) 



Set up the repeat block 

Filter: 
h(n,N-l-i) 

* x(n-(N-l-i)) -> RO 
Multiply and add operation 

UPDATE : 

x(n,N-(N-l-i) ) * tmuerr -> Rl 
Rl -> h(n+l,N-l-(i-l) ) 



h(n,N-l-i) + x(n-(N-l-i) ) 
* tmuerr -> Rl 

Add last product 
h(n,0) + x(n) 

* tmuerr -> h(n+l,0) 



* RETURN SEQUENCE 
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RETS ; Return 

* 

* end 
.end 



11.4.3 Matrix-Vector Multiplication 

In matrix-vector multiplication, a K x N matrix of elements m(i,j) having K rows 
and N columns is multiplied by an N x 1 vector to produce a K x 1 result. The 
multiplier vector has elements v(j), and the product vector has elements p(i). 
Each one of the product-vector elements is computed by the following expres- 
sion: 

p (i ) = m (i,0) v (0) + m (i,1) v (1) + ... + m (i,N - 1) v (N - 1) i = 0,1 K- 1 

This is essentially a dot product, and the matrix-vector multiplication contains, 
as a special case, the dot product presented in Example 11-2 on page 11-7. 
In pseudo-C format, the computation of the matrix multiplication is expressed 
by 

for (i = 0; i < K; i + +) { 
p (i) = 0 

for (j = 0; j < N; j + +) 

p(i) = p(i) + m(i,j)*v(j) 

} 

Figure 1 1-4 shows the data memory organization for matrix-vector multiplica- 
tion, and Example 11-34 shows the TMS320C3x assembly code that imple- 
ments it. Note that in Example 11-34, K (number of rows) should be greater 
than 0, and N (number of columns) should be greater than 1 . 
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Figure 11-4. Data Memory Organization for Matrix-Vector Multiplication 



Low 
Address 



High 
Address 



Matrix Storage 



m(0, 0) 



m(0, 1) 



m(0, N-1) 

m(1.0) 
mfl.1) 



Input 
Vector Storage 



v(0) 



Result 
Vector Storage 



P(0) 



I v(N-D "1 I P(K-1) ~| 
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Example 11-34. Matrix Times a Vector Multiplication 



* 






* 
* 


TITLE 


MATRIX TIMES A VECTOR MULTIPLICATION 


* 
* 
* 


SUBROUTINE MAT 


it 
* 


MAT — MATRIX TIMES A VECTOR OPERATION 


* 
* 


TYPICAL CALLING SEQUENCE:* 


* 


load 


ARO 


* 


load 


ARl 


* 


load 


AR2 


* 


load 


AR3 


* 


load 


Rl 


* 


CALL 


MAT 


* 






* 







* ARGUMENT ASSIGNMENTS: 



* ARGUMENT | FUNCTION 



ARO 


| ADDRESS OF M(0,0) 






ARl 


j ADDRESS OF V(0) 






AR2 


| ADDRESS OF P(0) 






AR3 


| NUMBER OF ROWS - 1 


(K- 


1) 


Rl 


j NUMBER OF COLUMNS - 


2 


(N-2) 



* REGISTERS USED AS INPUT: ARO, ARl, AR2 , AR3, Rl 

* REGISTERS MODIFIED: RO, R2, ARO, ARl, AR2, AR3, IRO, 

* RC, RSA, REA 
* 

* PROGRAM SIZE: 11 
* 

* EXECUTION CYCLES: 6+10*K+K* (N-l) 
* 



.global MAT 
* SETUP 

; Number of columns-2 -> IRO 
; IRO - N 



MAT LDI R1,IR0 
ADDI 2 , IRO 
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* FOR (i ■ 0; i < K; i++) LOOP OVER THE ROWS 



ROWS LDF 0.0, R2 ; Initialize R2 

MPYF3 *AR0++ ( 1 ) , *AR1++ ( 1 ) , R0 
* j m(i,0) * v(0) -> R0 



* FOR (j = 1; j < N; j++) DO DOT PRODUCT OVER COLUMNS 



RPTS Rl 



Multiply a row by a column 



MPYF3 *AR0++(1) , *AR1++(1) ,R0 ; m(i,j) * v(j) -> R0 

|| ADDF3 R0,R2,R2 ; m(i,j-l) * v(j-l) + R2 -> R2 

* 

DBD AR3,ROWS j Counts the no. of rows left 



ADDF R0,R2 ; 

STF R2,*AR2++(1) ? 

NOP * — ARl ( IRO ) 

! I 1 DELAYED BRANCH HAPPENS HERE 1 I ! 



Last accumulate 
Result -> p(i) 

Set ARl to point to v(0) 



* RETURN SEQUENCE 



RETS 



Return 



* end 
* 

• end 



11.4.4 Fast Fourier Transforms (FFT) 

Fourier transforms are an important tool often used in digital signal processing 
systems. The purpose of the transform is to convert information from the time 
domain to the frequency domain. The inverse Fourier transform converts infor- 
mation back to the time domain from the frequency domain. Implementation 
of Fourier transforms that are computationally efficient are known as fast Four- 
ier transforms (FFTs). The theory of FFTs can be found in books such as DFT/ 
FFT and Convolution Algorithms by C.S. Burrus and T.W. Parks (John Wiley, 
1 985) and Digital Signal Processing Applications with the TMS320 Family by 
Texas Instruments (literature number SPRA012A). 
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Fast Fourier transform is a label for a collection of algorithms that implement 
efficient conversion from time to frequency domain. There are several types 
of FFTs: 

□ Radix-2 or radix-4 algorithms (depending on the size of the FFT butterfly) 

□ Decimation in time or frequency (DIT or DIF) 

□ Complex or real FFTs 

□ FFTs of different lengths, etc. 

Certain TMS320C3x features that increase efficient implementation of numer- 
ically intensive algorithms are particularly well-suited for FFTs. The high speed 
of the device (33-ns cycle time) makes implementation of real-time algorithms 
easier, while floating-point capability eliminates the problems associated with 
dynamic range. The powerful indirect-addressing indexing scheme facilitates 
the access of FFT butterfly legs with different spans. The repeat block implem- 
ented by the RPTB instruction reduces the looping overhead in algorithms 
neavily dependent on loops (such as the FFTs). This construct provides the 
efficiency of in-line coding in loop form. The FFT will reverse the bit order of 
the output; therefore, the output must be reordered. This reordering does not 
require extra cycles, because the device has a special mode of indirect ad- 
dressing (bit-reversed addressing) for accessing the FFT output in the original 
order. 

The examples in this subsection were based on programs contained in the 
Burrus and Parks book and in the paper Real-Valued Fast Fourier Transform 
Algorithms by H.V Sorensen, et al (IEEE Transform on ASSP, June 1987). 

Example 11-35 and Example 11-36 show the implementation of a complex 
radix-2, DIF FFT on the TMS320C3x. Example 11-35 contains the generic 
code of the FFT, which can be used with a number of any length. However, for 
the complete implementation of an FFT, you need a table of twiddle factors 
(sines/cosines); the length of the table depends on the size of the transform. 
To retain the generic form of Example 1 1 -35, the table with the twiddle factors 
(containing 1-1/4 complete cycles of a sine) is presented separately in 
Example 1 1 -36 for the case of a 64-point FFT. A full cycle of a sine should have 
a number of points equal to the FFT size. Example 1 1-36 uses two variables: 
N, which is the FFT length, and M, which is the logorithm of N to a base equal 
to the radix. In other words, Mis the number of stages of the FFT. For example, 
in a 64-point FFT, M = 6 when using a radix-2 algorithm, and M = 3 when using 
a radix-4 algorithm. If the table with the twiddle factors and the FFT code are 
kept in separate files, they should be connected at link time. 
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Example 11-35. Complex, Radix-2, DIFFFT 
* 

* TITLE COMPLEX, RADIX-2, DIF FFT 

* GENERIC PROGRAM FOR LOOPED-CODE RADIX-2 FFT COMPUTATION IN TMS320C3X 
* 

* THE PROGRAM IS TAKEN FROM THE BURRUS AND PARKS BOOK, P. 111. 

* THE (COMPLEX) DATA RESIDE IN INTERNAL MEMORY. THE COMPUTATION 

* IS DONE IN PLACE, BUT THE RESULT IS MOVED TO ANOTHER MEMORY 

* SECTION TO DEMONSTRATE THE BIT-REVERSED ADDRESSING. 
* 

* THE TWIDDLE FACTORS ARE SUPPLIED IN A TABLE THAT IS PUT IN A .DATA 

* SECTION. THIS DATA IS INCLUDED IN A SEPARATE FILE TO PRESERVE THE 

* GENERIC NATURE OF THE PROGRAM. FOR THE SAME PURPOSE, THE SIZE OF 

* THE FFTN AND LOG2(N) ARE DEFINED IN A • GLOBL DIRECTIVE AND SPECIFIED 

* DURING LINKING. 
* 

* 





.globl FFT 




Entry point for execution 




.globl N 


> 


FFT size 




.globl M 




LOG2(N) 




.globl SINE 




Address of sine table 


INP 


.usect "IN", 1024 


9 


MrlKSUiyJi. j[ W«I»l»ll XlipUL UCllvO 


.BSS 


OUTP,1024 


t 


Memory with output data 




.text 






* INITIALIZE 






FFTSIZ 


.word N 






LOGFFT 


.word M 






SINTAB 


.word SINE 






INPUT 


• word INP 






OUTPUT 


.word OUTP 






FFT: 


LDP FFTSIZ 


t 


Command to load data page pointer 


LDI 


@FFTSIZ,IR1 






LSH 


-2,IR1 


t 


IR1 » N/4, pointer for SIN/COS table 


LDI 


0,AR6 


t 


AR6 holds the current stage number 


LDI 


8FFTSIZ,IR0 






LSH 


1,IR0 


*9 


IRO = 2*N1 (because of real/imag) 


LDI 


@FFTSIZ,R7 


r 


R7 = N2 


LDI 


1,AR7 


1 


Initialize repeat counter 






} 


of first loop 


LDI 


1,AR5 


} 


Initialize IE index (AR5 = IE) 
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OUTER LOOP 



LOOP: 



NOP 
LDI 
ADD I 
LDI 
SUBI 



*++AR6 ( 1 ) 
@ INPUT, ARO 
R7 , ARO , AR2 
AR7,RC 
1,RC 



Current FFT stage 
ARO points to X(I) 
AR2 points to X(L) 

RC should be one less than desired # 



FIRST 


LOOP 










RPTB 


BLK1 








ADDF 


*AR0,*AR2,R0 , 


■ RO - 


X(I)+X(L) 




SUBF 


*AR2++,*AR0++,R1 , 


• Rl * 


X(I)-X(L) 




ADDF 


*AR2 , *AR0 ,R2 


• R2 = 


Y(I)+Y(L) 




SUBF 


*AR2,*AR0,R3 , 


• R3 - 


Y(I)-Y(L) 




STF 


R2 , *AR0 , 


' Y(I) 


* R2 and . . . 


1 1 
l 1 


STF 


R3 , *AR2 , 


Y'L) 


= R3 


BLK1 


STF 


RO, *AR0++(IR0) , 


' X(I) 


■ RO and . . . 


II 


STF 


R1,*AR2++(IR0) t 


1 X(L) 


= Rl and ARO 



ARO , 2 + 2*n 



* IF THIS IS THE LAST STAGE, YOU ARE DONE 



CMP I 
BZD 



§LOGFFT,AR6 
END 



MAIN INNER LOOP 

LDI 2,AR1 



INLOP : 



II 
* 

II 



LDI 
ADD I 

LDI 
ADD I 
ADD I 
ADD I 
LDI 
SUBI 



LDF 

SECOND LOOP 

RPTB 
SUBF 
SUBF 

MPYF 
ADDF 

MPYF 
STF 



@SINTAB, AR4 
AR5, AR4 

ARl , ARO 
2,AR1 

§ INPUT, ARO 
R7,AR0,AR2 
AR7,RC 
1,RC 

*AR4,R6 



BLK2 

*AR2,*AR0,R2 ; 
*+AR2,*+AR0,Rl 

i 

R2,R6,R0 ; 
*+AR2,*+AR0,R3 

R1,*+AR4(IR1) ,R3 ,• 
R3,*+AR0 ; 



Init loop counter for 

inner loop 
Initialize IA index (AR4 = 
IA = IA+IE; AR4 points to 

cosine 



IA) 



Increment inner loop counter 
(X(I),Y(I)) pointer 
(X(L) ,Y(L) ) pointer 

RC should be 1 less than 

desired # 
R6 - SIN 



R2 = X(I)-X(L) 



Rl = 
RO - 

R3 = 
R3 = 
Y(I) 



Y(I)-Y(L) 
R2*SIN and. , 

Y(I)+Y(L) 
Rl*COS and 
« Y(I)+Y(L) 
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SUBF 


R0,R3,R4 , 


• R4 - Rl * COS-R2 * SIN 




MPYF 


R1,R6,R0 , 


RO « Rl * SIN and. . . 


1 1 


ADDF 


*AR2,*AR0,R3 < 


• R3 - X(I) + X(L) 




MPYF 


R2,*+AR4(IRl),R3 t 


f R3 ■ R2 * COS and. . . 


II 

* 


STF 


R3, *ARO++(IRO) 




* 






f X(I) - X(I)+X(L) and ARO - AR0+2*N1 




ADDF 


R0,R3,R5 


f R5 - R2*COS+Rl*SIN 


BLK2 


STF R5 , 


*AR2++(IR0) 


f X(L) = R2 * COS+R1 * SIN, 








} incr AR2 and . . . 


II 


STF R4 , 


*+AR2 


} Y(L) « Rl*COS-R2*SIN 




CMP I 


R7,AR1 






BNE 


INLOP ; Loop back to the inner loop 




LSH 


1,AR7 


} Increment loop counter for next time 




BRD 


LOOP 


; Next FFT stage (delayed) 




LSH 


1,AR5 


; IE = 2*IE 




LDI 


R7 , IRO 


} Nl = N2 




LSH 


-1,R7 


} N2 - N2/2 



STORE RESULT OUT USING BIT-REVERSED ADDRESSING 



END: 



LDI 

SUBI 

LDI 

LDI 

LDI 

LDI 



@FFTSIZ,RC 
1,RC 

@FFTSIZ,IRO 
2,IR1 

8 INPUT, ARO 
eOUTPUT,ARl 



RC - N 

RC should be one less than desired # 
IRO = size of FFT = N 



RPTB 


BITRV 


LDF 


*+AR0(l) ,R0 


| | LDF 


*AR0++(IR0)B,R1 


BITRV STF 


R0,*+AR1(1) 


| | STF 


R1,*AR1++(IR1) 


SELF BR 


SELF 


.end 





Branch to itself at the end 
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Example 1 1-36. Table With Twiddle Factors for a 64-Point FFT 



*TITLE TABLE WITH TWIDDLE FACTORS FOR A 64-POINT FFT 
* 

* FILE TO BE LINKED WITH THE SOURCE CODE FOR A 64-POINT, RADIX-2 FFT 



.globl SINE 
.globl N 
.globl M 



N 
M 



•set 64 
• set 6 



SINE 



COSINE 



• data 

. float 
.float 
. float 

• float 
•float 
•float 
•float 
. float 
•float 
•float 
•float 
•float 
•float 
. float 
•float 
. float 

. float 
•float 
•float 
•float 
•float 
. float 
•float 
•float 
•float 
•float 
•float 
•float 
•float 
. float 
•float 



0.000000 
0.098017 
0.195090 
0.290285 
0.382683 
0.471397 
0.555570 
0.634393 
0.707107 
0.773010 
0.831470 
0.881921 
0.923880 
0.956940 
0.980785 
0.995185 

1.000000 
0.995185 
0.980785 
0.956940 
0.923880 
0.881921 
0.831470 
0.773010 
0.707107 
0.634393 
0.555570 
0.471397 
0.382683 
0.290285 
0.195090 
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•float 
.float 
•float - 
•float - 
. float - 
•float - 
•float - 
•float 
. float — 
•float - 
•float - 
•float — 
. float - 
•float - 
•float — 
•float - 
•float - 
• float 
. float — 
•float - 
. float — 
•float - 
•float — 
•float - 
•float - 
•float - 
. float - 
•float — 
•float - 
•float - 
•float - 
•float - 
•float - 



0.098017 

0.000000 

0.098017 

0.195090 

0.290285 

0.382683 

0.471397 

-0.555570 

0.634393 

0.707107 

0.773010 

0.831470 

0.881921 

0.923880 

0.956940 

0.980785 

0.995185 

-1.000000 

0.995185 

0.980785 

0.956940 

0.923880 

0.881921 

0.831470 

0.773010 

0.707107 

0.634393 

0.555570 

0.471397 

0.382683 

0.290285 

0.195090 

0.098017 
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•float 
• float 
.float 
.float 
.float 
•float 
.float 
•float 
•float 
•float 
•float 
•float 
•float 
•float 
. float 
•float 



0.000000 
0.098017 
0.195090 
0.290285 
0.382683 
0.471397 
0.555570 
0.634393 
0.707107 
0.773010 
0.831470 
0.881921 
0.923880 
0.956940 
0.980785 
0.995185 




algorithm is relatively easy to understand. However, radix-4 implementation 
can increase execution speed by reducing the amount of arithmetic required. 
Example 11-37 shows the generic implementation of a complex, DIF FFT in 
radix-4. A companion table, such as the one in Example 11-36, should have 
a value of M equal to the logN, where the base of the logarithm is 4. 
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Example 11-37. Complex, Radix-4, DIFFFT 

* 

* TITLE COMPLEX, RADIX-4, DIF FFT 

* GENERIC PROGRAM TO PERFORM A LOOPED-CODE RADIX-4 FFT COMPUTATION 

* IN THE TMS320C3X 

* THE PROGRAM IS TAKEN FROM THE BURRUS AND PARKS BOOK, P. 117. 

* THE (COMPLEX) DATA RESIDE IN INTERNAL MEMORY, AND THE COMPUTATION 

* IS DONE IN PLACE. 
* 

* THE TWIDDLE FACTORS ARE SUPPLIED IN A TABLE THAT IS PUT IN A .DATA 

* SECTION. THIS DATA IS INCLUDED IN A SEPARATE FILE TO PRESERVE THE 

* GENERIC NATURE OF THE PROGRAM. FOR THE SAME PURPOSE, THE SIZE OF 

* THE FFT N AND LOG4(N) ARE DEFINED IN A • GLOBL DIRECTIVE AND 

* SPECIFIED DURING LINKING. 
* 

* IN ORDER TO HAVE THE FINAL RESULT IN BIT-REVERSED ORDER, THE TWO 

* MIDDLE BRANCHES OF THE RADIX-4 BUTTERFLY ARE INTERCHANGED DURING 

* STORAGE. NOTE THIS DIFFERENCE WHEN COMPARING WITH THE PROGRAM IN 

* P. 117 OF THE BURRUS AND PARKS BOOK. 
* 



.globl 


FFT ; 


Entry point for execution 


.globl 


N ; 


FFT size 


•globl 


M ; 


LOG4(N) 


.globl 


SINE ; 


Address of sine table 


.usect 


"IN", 1024 ; 


Memory with input data 


.text 







* INITIALIZE 



TEMP .word 
STORE .word 
.word 
.word 
.word 
.word 



$+2 

FFTSIZ 

N 

M 

SINE 
INP 



Beginning of temp storage area 



.BSS 


FFTSIZ, 1 


f FFT size 


• BSS 


LOGFFT , 1 


. LOG4 (FFTSIZ) 


.BSS 


SINTAB, 1 


) Sine/cosine table base 


• BSS 


INPUT, 1 


f Area with input data to process 


.BSS 


STAGE, 1 


} FFT stage # 


• BSS 


RPTCNT,1 


f Repeat counter 


• BSS 


IEINDX,! , 


IE index for sine/cosine 
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.BSS LPCNT,1 ; Second-loop count 



FFTs 

* INITIALIZE DATA LOCATIONS 

LDP TEMP ; Command to load data page counter 

LDI @TEMP,AR0 

LDI @ STORE, ARl 

LDI *AR0++,R0 ; Xfer data from one memory to the other 

STI R0,*AR1++ 

LDI *AR0++,R0 

STI R0,*AR1++ 

LDI *AR0++,R0 

STI R0,*AR1++ 

LDI *AR0,R0 

STI RO , *AR1 

LDP FFTS I Z ? Command to load data page pointer 

LDI §FFTSIZ,R0 

LDI §FFTSIZ,IRO 

LDI §FFTSIZ,IRl 

LDI 0,AR7 

STI AR7 , § STAGE ; § STAGE holds the current stage number 

LSH 1,IR0 ; IRO - 2*Nl (because of real/imag) 

LSH -2,IR1 ; IR1 - N/4, pointer for SIN/COS table 

LDI 1,AR7 

STI AR7 , 6RPTCNT ; Init repeat counter of first loop 

STI AR7,@IEINDX ; Init. IE index 

LSH -2,R0 ; JT « RO/2+2 

ADD I 2,R0 

STI R0,@JT 

SUBI 2,R0 

LSH 1,R0 ; RO - N2 

* OUTER LOOP 
LOOP: 

LDI @ INPUT, ARO ; ARO points to X(I) 

ADD I R0,AR0,AR1 ; ARl points to X(I1) 

ADD I R0,AR1,AR2 ; AR2 points to X(I2) 

ADD I R0,AR2,AR3 ; AR3 points to X(I3) 

LDI §RPTCNT,RC 

SUBI 1,RC ; RC should be one less than desired # 

* FIRST LOOP 

RPTB BLK1 

ADDF *+AR0,*+AR2,Rl 
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; Rl * Y(I)+Y(I2) 





ADDF 


*+AR3,*+ARl,R3 




* 




; R3 « Y(I1)+Y(I3) 




ADDF 


R3,R1,R6 ; R6 ■ R1+R3 




SUBF 






* 






. "D A k V/T\ V/TO\ 

* f R4 « Y(I)-Y(I^) 




STF 


Jt\0 , T"/vr\.u 


» V / T \ s P 1 XP *3 




SUBF 


P** Pi 


» Pi s P 1 _D 1 




LDF 


* AR2 , R5 


» R5 as X(I2) 


1 1 

1 1 


LDF 


*+ARl,R7 , 


} R7 ■ Y(I1) 




ADDF 


*AR3,*ARl,R3 


? R3 ■ X(I1)+X(I3) 




»nnr 


R5, *AR0,R1 , 


; Rl - X(I)+X(I2) 


1 1 
1 1 


Oil 


Rl, *+ARl 


; Y(I1) » R1-R3 




ADDF 


R3,R1,R6 , 


; R6 - R1+R3 




SUBF 


R5,*AR0,R2 


} R2 = X(I)-X(I2) 


1 1 

1 1 


STF 


R6,*AR0++(IR0) , 


; X(I) - R1+R3 




SUBF 


R3,R1 


) Rl = R1-R3 




SUBF 


*AR3,*AR1,R6 


? R6 * X(I1)-X(I3) 




QTTUT? 
DUOf 


R7,*+AR3,R3 


) -R3 - Y(I1)-Y(I3) 


1 1 
1 1 


olr 


Rl,*ARl++(IR0) 


A (11; — RI-RJ 




SUBF 


R6,R4,R5 


f R5 - R4-R6 




ADDF 


R6,R4 


» PA s PAxpfi 




STF 


R5 , *+AR2 


f Y(I2) - R4-R6 


II 


STF 


R4,*+AR3 , 


) Y(I3) - R4+R6 




SUBF 


R3,R2,R5 t 


R5 = R2-R3 




ADDF 


R3,R2 


) R2 ■ R2+R3 




STF 


R5,*AR2++(IR0) t 


• X(I2) - R2-R3 


1 1 
1 1 


STFR2,*AR3++(IR0) , 


• X(I3) - R2+R3 


* IF 


THIS 


IS THE LAST STAGE, 


YOU ARE DONE 




LDI 


@ STAGE, AR7 






ADD I 


1,AR7 






CMP I 


@LOGFFT, AR7 






BZD 


END 






STI 


AR7 , @ STAGE ; Current FFT stage 



* MAIN INNER LOOP 



LDI 1,AR7 

STI AR7,@IAl 

LDI 2,AR7 

STI AR7,@LPCNT 

LDI 2,AR6 

ADDI §LPCNT,AR6 

LDI @LPCNT,AR0 

LDI §IA1,AR7 

ADDI §IEINDX,AR7 

ADDI @ INPUT, ARO 

STI AR7,eiAl 
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Init loop counter for inner loop 
INLOPs 

Increment inner loop counter 



IAl * IA1+IE 
(X(I),Y(I) ) pointer 
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ADD I 


R0,AR0,AR1 ,* 


(X(I1),Y(I1)) pointer 


STI 


AR6,@LPCNT 






ADDI 


R0,AR1,AR2 ; 


(X( 12 ) , Y( 12 ) ) pointer 


ADD I 


R0,AR2,AR3 ; 


(X(I3) ,Y(I3) ) pointer 


LDI 


§RPTCNT,RC 






SUBI 


1,RC 




RC should be one less than 


CMP I 


@JT, AR6 




If LPCNT * JT, go to 


BZD 


SPCL 




special butterfly 


T TNT 


8IA1,AR7 






t fit 


§IAl,AR4 






ADDI 


@SINTAB,AR4 ; 


Create cosine index AR4 


SUBI 


1,AR4 ; 


Adjust sine table pointer 


ADDI 


AR4 , AR7 , AR5 






SUBI 


1 , AR5 ; 


IA2 - IAl+IAl-1 


ADDI 


AR7,AR5,AR6 






SUBI 


1 , AR6 ; 


IA3 - IA2+IA1-1 


SECOND LOOP 






RPTB 


BLK2 






ADDF 


*+AR2, *+AR0,R3 








r 


DO — V / T \ IV / T O \ 
KJ — X (1)TX (1^ ) 


ADDF 


*+AR3,*+ARl,R5 








? 


PR sb V 1 T 1 \ 4-V / T "\ \ 


ADDF 


R5,R3,R6 


Pfi = R74.Rt; 
*%w TAJ 


SUBF 


*+AR2,*+AR0,R4 












"D A — V / T \ V/TO\ 


SUBF 


R5,R3 




Rj == RJ— KD 


ADDF 


*AR2 t *AR0,R1 , 




Bl = V / T \ XV / T O \ 

Ki 38 &.[ ± ) -rA( ±Z ) 


ADDF 


*AR3,*ARl,R5 , 






MPYF 


R3, *+AR5(IRl) ,R6 


R6 




STF 


R6,*+AR0 t 




Yfll » R3+R5 


ADDF 


R5,R1,R7 




R7 = R1+R5 


SUBF 


*AR2 , *AR0 , R2 , 




R2 ■ X(I)-X(I2) 


SUBF 


R5,R1 , 




Rl =s Rl— R5 


MPYF 


R1,*AR5,R7 




R7 = R1*SI2 


STF R7, 


*AR0++(IR0) t 




* R1+R5 


SUBF 


R7 , R6 




R6 =s R3*C02— Rl*ST2 


SUBF 


*+AR3,*+ARl,R5 












— 1 \ ll)'*I \1J ) 


MPYF 


Rl,*+AR5(IRl) ,R7 , 




R7 = Rl*C02 


STF 


R6,*+AR1 t 




Y(I1) = R3*C02-Rl*SI2 


MPYF 


R3,*AR5,R6 t 




R6 35 R3*SI2 


ADDF 


R7,R6 




R6 » Rl*C02+R3*SI2 


ADDF 


R5,R2,R1 < 




Rl = R2+R5 


SUBF 


R5 ,R2 




R2 = R2-R5 


SUBF 


*AR3,*ARl,R5 , 




R5 - X(I1)-X(I3) 


SUBF 


R5,R4,R3 




R3 = R4-R5 


ADDF 


R5 ,R4 




R4 = R4+R5 


MPYF 


R3,*+AR4(IRl),R6 




R6 - R3*C01 


STF R6 , 


*AR1++(IR0) 




X(I1) - R1*C02+R3*SI2 
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II 



MPYF R1,*AR4,R7 

SUBF R7,R6 

MPYF Rl , *+AR4 ( IRl ) , R6 

STF R6 , *+AR2 

MPYF R3,*AR4,R7 

ADDF R7 , R6 

MPYF R4 , *+AR6 ( IRl ) , R6 

STFR6,*AR2++(IR0) 

MPYF R2,*AR6,R7 

SUBF R7,R6 

MPYF R2 , *+AR6 ( IRl ) , R6 

STF R6, *+AR3 

MPYF R4,*AR6,R7 

ADDF R7,R6 



R7 - R1*SI1 

R6 - R3*C01-R1*SI1 

R6 - Rl*C01 

Y(I2) = R3*C01-Rl*SIl 

R7 - R3*SI1 

R6 = Rl*C 01+R3*SI1 

R6 « R4*C03 

X(I2) = R1*C01+R3*SI1 

R7 - R2*SI3 

R6 - R4*C03-R2*SI3 

R6 - R2*C03 

Y(I3) - R4*C03-R2*SI3 

R7 = R4*SI3 

R6 - R2*C03+R4*SI3 



BLK2 STF R6,*AR3++(IR0) 



x(i3) = R2*C03+R4*SI3 



CMPI @LPCNT,RO 

BP INLOP ; Loop back to the inner loop 

BR CONT 



* SPECIAL BUTTERFLY FOR W « J 



SPCL LDI IRl , AR4 

LSH -1 , AR4 ; Point to SIN(45) 

ADDI @SINTAB f AR4 ? Create cosine index AR4 * C021 



RPTB BLK3 

ADDF *AR2 / *AR0,R1 

SUBF *AR2,*AR0,R2 

ADDF *+AR2,*+AR0 / R3 

SUBF *+AR2,*+AR0,R4 

ADDF *AR3 r *ARl,R5 

SUBF R1,R5,R6 

ADDF R5,R1 

ADDF *+AR3,*+ARl,R5 

SUBF R5,R3,R7 

ADDF R5 ,R3 

STF R3,*+AR0 

STF Rl, *AR0++(IR0) 

SUBF *AR3 , *AR1 ,R1 

SUBF *+AR3 f *+ARl,R3 

STF R6,*+AR1 



/ 


Rl = 


X(I)+X(I2) 


/ 


R2 = 


X(I)-X(I2) 


/ 


R3 - 


Y(I)+Y(I2) 


1 


R4 = 


Y(I)-Y(I2) 


r 


R5 = 


X(I1)+X(I3) 


f 


R6 = 


R5-R1 


t 


Rl = 


R1+R5 


} 


R5 = 


Y(I1)+Y(I3) 


i 


R7 = 


R3-R5 


9 


R3 = 


R3+R5 


f 


Y(I) 


= R3+R5 


• 

r 


X(I) 


- R1+R5 


t 


Rl = 


X(I1)-X(I3) 



; R3 - Y(I1)-Y(I3) 
} Y(I1) = R5-R1 
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II 


STF 


R7, *AR1++(IR0) , 


\ X(I1) » R3-R5 




ADDF 


R3,R2,R5 , 


) R5 » R2+R3 




SUBF 


R2,R3,R2 t 


■ R2 - -R2+R3 




SUBF 


R1,R4,R3 t 


) R3 - R4-R1 




ADDF 


Rl ,R4 < 


\ R4 - R4+R1 




SUBF 


R5,R3,R1 , 


• Rl = R3-R5 




MPYF 


*AR4 ,R1 , 


} Rl - R1*C021 




ADDF 


R5,R3 , 


) R3 « R3+R5 




MPYF 


*AR4 1 R3 t 


\ R3 * R3*C021 


II 


STF 


Rl,*+AR2 , 


• Y(I2) » (R3-R5)*C021 




SUBF 


R4,R2,Rl , 


• Rl « R2-R4 




MPYF 


*AR4,Rl , 


• Rl - R1*C021 


II 


STF 


R3,*AR2++(IR0) j 


• X(I2) « (R3+R5)*C021 




ADDF 


R4,R2 ; 


• R2 - R2+R4 




MPYF 


*AR4,R2 j 


• R2 ■ R2*C021 


BLK3 


STF 


Rl,*+AR3 


» Y(I3) - -(R4-R2)*C021 


II 


STF R2 , 


*AR3++(IR0) , 


X(I3) - (R4+R2)*C021 



CONT 



CMPI 


§LPCNT,R0 






BPD 


INLOP 




Loop back to the 


LDI 


GRPTCNT, AR7 






LDI 


§IEINDX,AR6 






LSH 


2,AR7 


? 


Increment repeat 






r 


next time 


STI 


AR7 , §RPTCNT 






LSH 


2,AR6 


i 


IE « 4*IE 


STI 


AR6 , @IEINDX 






LDI 


RO , IRO 


9 


Nl « N2 


LSH 


-3,R0 






ADDI 


2,R0 






STI 


R0,8JT 


i 


JT - N2/2+2 


SUBI 


2,R0 






LSH 


1,R0 


} 


N2 - N2/4 


BR 


LOOP 


} 


Next FFT stage 



STORE RESULT USING BIT-REVERSED ADDRESSING 
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END: LDI §FFTSIZ,RC ; RC - N 

SUBI 1,RC ; RC should be one less than desired # 

LDI §FFTSIZ,IRO ? IRO ■ size of FFT » N 

LDI 2 , IR1 

LDI @ INPUT, ARO 

LDP STORE 

LDI § STORE, ARl 



RPTB BITRV 

LDF *+AR0(l),R0 

| | LDF *AR0++ ( IRO ) B , Rl 

BITRV STF R0,*+AR1(1) 

|| STF R1,*AR1++(IR1) 

SELF BR SELF ; Branch to itself at the end 

• end 



The data to be transformed is usually a sequence of real numbers. In this case, 
the FFT demonstrates certain symmetries that permit the reduction of the 
computational load even further. Example 11-38 shows the generic imple- 
mentation of a real-valued, radix-2 FFT. For such an FFT, the total storage re- 
quired for a length-N transform is only N locations; in a complex FFT, 2N are 
necessary. Recovery of the rest of the points is based on the symmetry condi- 
tions. 

Example 11-39 shows the implementation of a radix-2 real inverse FFT. The 
inverse transformation assumes that the input data is given in the order pres- 
ented at the output of the forward transformation and produces a time signal 
in the proper order (that is, bit reversing takes place at the end of the program) . 
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Example 11-38. Real, Radix-2 FFT 



******************************************* 

* FILENAME 

* 

* WRITTEN BY 

* 
* 

* DATE 



f f ft_rl.asm 

Alex Tessarolo 

Texas Instruments, Australia 



* VERSION 



: 23rd July 1991 
: 2.0 



***************************************************************************** 



* 

* VER 

* 1.0 

* 2.0 
* 

* 

* 
* 



DATE 



18th July 91 
23rd July SI 



COMMENTS 



Original release. 
Most stages modified. 

Minimum FFT size increased from 32 to 64. 
Faster in place bit reversing algorithm. 
Program size increased by about 100 words. 
One extra data word required. 



***************************************************************************** 



* SYNOPSIS s 

* 

* 

* 

* 

* 

★ 

* 

* 

* 

* 

* 

★ 

* 

* 

* 

* 



int ffft_rl( FFT__SIZE, LOGJ3IZE, SOURCE_ADDR, DEST_ADDR, 

SINEJTABLE, BIT__RE VERSE ); 



int FFT_SIZE 

int LOGJ3IZE 

float *SOURCE_ADDR 

float *DEST ADDR 



float 
int 



NOTE: 



*SINE_TABLE 
BIT REVERSE 



64, 128, 256, 512, 1024, ... 
6, 7, 8, 9, 10, ... 

Points to location of source data. 

Points to where data will be 

operated on and stored. 

Points to the SIN/COS table. 

* 0, bit reversing is disabled. 

<> 0, bit reversing is enabled. 



1) If SOURCE_ADDR - DEST_ADDR, then in-place bit 
reversing is performed, if enabled (more 
processor intensive). 

2) FFT_SIZE must be >= 64 (this is not checked). 
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* DESCRIPTION: Generic function to do a radix-2 FFT computation on the C30. 

* The data array is FFT_SIZE-long with only real data. The out- 

* put is stored in the same locations with real and imaginary 
points R and I as follows: 



DEST__ADDR[ 0 ] 



R(0) 
R(l) 
R(2) 
R(3) 



R(FFT__SIZE/2) 
I(FFT_SIZE/2 - 1) 



DEST_ADDR[FFT_SIZE - 1] 



1(2) 
1(1) 



The program is based on the FORTRAN program in the 
paper by Sorensen et al., June 1987 issue of Trans, 
on ASSP. 

Bit reversal is optionally implemented at the begin- 
ning of the function. 

The sine/cosine table for the twiddle factors is ex- 
pected to be supplied in the following format: 



SINE_TABLE[0] 



sin(0*2*pi/FFT_SIZE) 
sin(l*2*pi/FFT_SIZE) 



sin( (FFT_SIZE/2-2)*2*pi/FFTJ3IZE) 
SINE_TABLE [FFT_J5IZE/2 - 1]> sin( (FFT_SIZE/2-l ) *2*pi/FFT_SIZE ) 

NOTE: The table is the first half period of a sine wave. 

Stack structure upon call: 



-FP(7) 
-FP(6) 
-FP(5) 
-FP(4) 
-FP(3) 
-FP(2) 
-FP(1) 
-FP(0) 



BIT_REVERSE 

SINE_TABLE 

DEST_ADDR 

SOURCE__ADDR 

LOG_SIZE 

FFT_SIZE 

returne 

addr 

old FP 



***************************************** 
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* NOTE: Calling C program can be compiled using either large 

* or small model. 
* 

* WARNING: DP initialized only once in the program. Be wary 

* with interrupt service routines. Make sure interrupt 

* service routines save the DP pointer. 
* 

* WARNING: The DEST__ADDR must be aligned such that the first 

* LOG_SIZE bits are zero (this is not checked by the 

* program) . 
* 

***************************************************************************** 

* 

* REGISTERS USED: RO, Rl, R2 , R3, R4, R5, R6 , R7 

* ARO, AR1, AR2, AR3 , AR4 , AR5, AR6 , AR7 

* IRO, IR1 

* RC, RS, RE 

* DP 
* 

* MEMORY REQUIREMENTS: Program - 405 Words (approximately) 

* Data « 7 Words 

* Stack » 12 Words 
* 

★*********************** ***************** 

* 

* BENCHMARKS: Assumptions — Program in RAMO 

* — Reserved data in RAMO 

* — Stack on primary/expansion bus RAM 

* — Sine/cosine tables in RAMO 

* — Processing and data destination in RAMI. 

* — Primary/expansion bus RAM, 0 wait state. 
* 

* FFT Size Bit Reversing Data Source Cycles (C30) 



* 1024 OFF RAMI 19816 approx. 

* Note: This number does not include the C callable overheads. 

* Add 57 cycles for these overheads. 
* 

***************************************************************************** 



; Entry execution point. 

; Reserve memory for arguments. 



FP 


. set 


AR3 






• global 


_ffft_rl 




FFTJ3IZE: 


•usect 


".fftdata" 


rl 


LOGJSIZE: 


•usect 


".fftdata" 


rl 


SOURCE_ADDR: 


•usect 


".fftdata" 


rl 


DESTJVDDR: 


•usect 


".fftdata" 


,1 


SINE_TABLE: 


•usect 


".fftdata" 


,1 


BITJREVERSE : 


•usect 


".fftdata" 


rl 


SEPARATION: 


•usect 


".fftdata" 


rl 
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. sect 

ffft_rl: PUSH FP 

LDI SP,FP 

PUSH R4 

PUSH R5 

PUSH R6 

PUSHF R6 

PUSH R7 

PUSHF R7 

PUSH AR4 

PUSH AR5 

PUSH AR6 

PUSH AR7 

PUSH DP 

LDP FFT_SIZE ; Init. DP pointer. 

LDI *— FP(2),R0 ; Move arguments from stack. 

STI RO,@FFT_SIZE 

LDI *-FP(3),R0 

STI R0,@LOG_SIZE 

LDI *-FP(4),R0 

STI RO , @ SOURCE_ADDR 

LDI *-FP(5),R0 

STI RO , @DEST_ADDR 

LDI *-FP(6),R0 

STI RO , @SINE__TABLE 

LDI *-FP(7),R0 

STI RO , @BXT_RE VERSE 

; Check bit reversing mode (on or off). 

• BIT_RE VERSING - 0, then OFF 

; (no bit reversing). 

; BITJREVERSING <> 0, Then ON. 

LDI @BIT_REVERSE,R0 

CMPI 0,R0 

BZ MOVE DATA 



Check bit reversing type. 

If SourceAddr * DestAddr, then in place 

bit reversing. 
If SourceAddr <> DestAddr, then 

standard bit reversing. 



? Initialize C function, 
".ffttext" 

; Preserve C environment. 
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LDI §SOURCE_ADDR,R0 
CMPI §DEST_ADDR,RO 
BEQ IN_PLACE 

; Bit reversing Type 1 (from source to 
; destination) • 

; NOTE: abs ( SOURCE_ADDR - DEST_ADDR) 
; must be > FFT__SIZE, this is not 
; checked. 



LDI §FFT_SIZE,RO 

SUBI 2,R0 

LDI §FFT_SIZE,IRO 

LSH -1,IR0 ; IRO - half FFT size. 

LDI @SOURCE_ADDR,AR0 

LDI §DEST_ADDR, ARl 

LDF *AR0++,R1 

RPTS RO 

LDF *AR0++,R1 

| | STF Rl , *ARl++ ( IRO ) B 

STF Rl, *ARl++(IR0)B 

BR START 



; In-place bit reversing. 



; Bit reversing on even locations, 
; 1st half only. 



IRO = quarter FFT size. 



IN_PLACE: LDI @FFT_SIZE, IRO 

LSH -2, IRO 

LDI 2,IR1 

LDI @FFT_SIZE,RC 

LSH -2,RC 

SUBI 3,RC 

LDI @DEST_ADDR,ARO 

LDI ARO, ARl 

LDI ARO , AR2 



NOP *AR1++(IR0)B 

NOP *AR2++(IR0)B 

LDF *++AR0(IRl) f R0 

LDF *AR1 r Rl 

CMPI ARl, ARO ; Xchange Iocs only if AR0<AR1 . 

LDFGT R0,R1 

LDFGT *AR1++ ( IRO ) B , Rl 
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BITRVl : 



BITRV2 J 



T}T)mn 
txir lo 


DTTDVl 
DilKV 1 




w "r+iiKU [ 1K1 ) , KU 


|| 


STF RO 


LDF 


*AR1,R1 


1 1 


STF Rl 


CMPI 


ARl , ARO 


LDFGT 


R0,R1 


LDFGT 


*AR1++(IR0)B,R0 


STF 


RO, *AR0 


STF 


Rl,*AR2 

? 


T HT 


9 

err x o , KL 


T CH 

lion 




T HT 




ADD I 


RC,AR0 


ADDI 


1,AR0 


LDI 


ARO, ARl 


LDI 


ARO , AR2 


LSH 


-1,RC 


SUBI 


3,RC 




* AD 1 XX / TDfi \ D 


NOP 


*AR2++(IR0)B 


LDF 


*++AR0(IRl) ,R0 


LDF 


*ARl,Rl 


CMPI 


ARl, ARO } 


LDFGT 


RO , Rl 


LDFGT 


*AR1++(IR0)B,R1 




DTrppT/O 
Ol XXxV Z 




*XXADO/TDl \ Dfi 


|| 


STF RO 


LDF 


*AR1,R1 


1 1 


STF Rl 


CMPI 


ARl, ARO 


LDFGT 


R0,R1 


LDFGT 


*AR1++(IR0)B,R0 


STF 


R0,*AR0 


STF 


Rl, *AR2 

? 


T.nT 


9 

13 T3*T3 ir n CT7P DP 


LSH 


-1,RC 


LDI 


RC,IR0 


LDI 


§ DEST_ADDR , ARO 


LDI 


ARO, ARl 


ADDI 


1,AR0 



Perform bit reversing on odd 
locations, 2nd half only. 



Xchange Iocs only if AR0<AR1. 



R1,*AR2++(IR0)B 



Perform bit reversing on odd 
locations, 1st half only. 
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BITRV3 : 



ADDI IR0,AR1 

LSH -1,RC 

LDI RC,IRO 

SUBI 2,RC 

LDF *ARO,RO 

LDF *AR1,R1 

RPTB BITRV3 

LDF *++ARO(IRl),RO 

| | STF RO , *AR1++ ( IRO ) B 

LDF *AR1,R1 

|| STF Rl,*-ARO(IRl) 

STF RO,*ARl 

STF Rl,*ARO 



BR 



START 



Check data source locations. 



If SourceAddr * DestAddr, then 
do nothing. 

If SourceAddr <> DestAddr, then move 
data. 



MOVEJ)ATAs LDI 
CMPI 
BEQ 



8 SOURCE_ADDR , RO 

§DEST_ADDR,RO 

START 



LDI 8FFTJ3IZE,R0 

SUBI 2,R0 

LDI 8SOURCE_ADDR,AR0 

LDI 8 DEST_ADDR , AR 1 

LDF *AR0++,R1 

RPTS RO 

LDF *AR0++,R1 

|| STF R1,*AR1++ 

STF Rl,*ARl 
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Perform first and second FFT loops. 



ARl 


> 


11 


0 


«- 


[X(I1) 


+ X(I2)] + [X(I3) + 


X(I4)] 


AR2 


* 


12 


1 


«- 


[X(I1) 


- X(I2)] 




AR3 


> 


13 


2 


<- 


[X(I1) 


+ X(I2) ] - [X(I3) + 


X(I4)] 


AR4 


> 


14 


3 


<- 


~[X(I3) 


- X(I4)] 




ARl 






4 











START: 



L00P1 2 J 



LDI @DEST_ADDR,AR1 

LDI ARl , AR2 

LDI ARl , AR3 

LDI ARl , AR4 

ADD I 1 , AR2 

ADDI 2 , AR3 

ADD I 3 , AR4 

LDI 4 r IR0 

LDI §FFT_SIZE,RC 

LSH -2,RC 

SUBI 2,RC 

LDF *AR2 , RO 

| | LDF *AR3 ,R1 

ADDF3 Rl,*AR4,R4 

SUBF3 Rl , *AR4++ ( IRO ) , R5 

SUBF3 R0,*AR1,R6 

ADDF3 RO , *ARl++ ( IRO ) , R7 

ADDF3 R7,R4,R2 

SUBF3 R4,R7,R3 

RPTB LOOPl_2 

LDF *+AR2(IR0) ,R0 

LDF *+AR3(IR0) ,R1 

ADDF3 R1,*AR4,R4 

STF R3, *AR3++(IR0) 

SUBF3 Rl f *AR4++(IR0) ,R5 

STF R5 , *-AR4 ( IRO ) 

SUBF3 RO, *AR1,R6 

STF R6 f *AR2++(IR0) 

ADDF3 RO , *AR1++ ( IRO ) , R7 

STF R2 , *-ARl ( IRO ) 

ADDF3 R7,R4,R2 

SUBF3 R4 f R7,R3 

STF R3 , *AR3 

STF R5 , *-AR4 ( IRO ) 

STF R6 , *AR2 

STF R2 , *-ARl ( IRO ) 



RO 
Rl 
R4 
R5 
R6 
R7 
R2 
R3 



X(I2) 
X(I3) 
X(I3) + 



X(I4) 



-[X(I3) - X(I4)] 
X(I1) - X(I2) 



X(I1) + 
R7 + R4- 
R7 - R4- 



X(I2) 



X(I3) <~ 



X(I4) <~ 



X(I2) <- 



X(I1) + 
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Perform third FFT loop. 
Part A: 



AR1 



AR2 > 



AR3 



AR1 



LOOP 3 A: 



II 



12 



13 



14 



II 



II 



X(I1) + X(I3) 



X(I1) - X(I3) 
-X(I4) 



▼ 




LDI 


@DEST_ADDR, ARl 


LDI 


ARl , AR2 


LDI 


ARl , AR3 


ADD I 


4 , AR2 


ADD I 


6, AR3 


LDI 


8 f IRO 


LDI 


@FFT_SIZE,RC 


LSH 


-3,RC 


SUBI 


2,RC 


SUBF3 


*AR2 , *ARl ,Rl 


ADDF3 


*AR2 r *ARl ,R2 


NEGF 


*AR3,R3 


RPTB 


LOOP3_A 


LDF 


*+AR2(IR0) ,R0 


STF 


R2,*ARl++(IR0) 


SUBF3 


R0,*AR1,R1 


STF 


Rl,*AR2++(IR0) 


ADDF3 


RO , *ARl , R2 


STF 


R3, *AR3++(IR0) 


NEGF 


*AR3,R3 


STF 


R2 , *AR1 


STF 


Rl, *AR2 


STF 


R3,*AR3 



; RO « X(I3) 

Rl = X(I1) - X(I3)- 

R2 = X(I1) + X(I3)- 

R3 = ~X(I4)- 

X(I1)<« 

X(I3)^ 

X(I4)« 
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Part B: 









0 








ARO 




11 


1 


4- 


X[I1] 


+ [X(I3)*COS+ X(I4)*C0S] 








2 




AR1 




12 


3 


«- 


X[I1] 


- [X(I3)*COS+ X(I4)*C0S] 








4 




AR2 


> 


13 


5 


<- 


-X[I2] 


- [X(I3)*COS- X(I4)*C0S] 








6 




AR3 




14 


7 


«- 


X[I2] 


- [X(I3)*C0S- X(I4)*C0S] 








8 






ARO 


> 




9 




NOTE: 


COS(2*pi/8) - SIN(2*pi/8) 



II 



I 



LDI 

LSH 

LDI 

SUBI 

LDI 

LDI 

LDI 

LDI 

LDI 

ADD I 

ADDI 

ADDI 

ADDI 

LDI 

LDF 

MPYF3 

MPYF3 

ADDF3 

MPYF3 

SUBF3 

SUBF3 

ADDF3 

STF 

SUBF3 

STF 

ADDF3 

STF 



RPTB 
MPYF3 
| STF 
ADDF3 
MPYF3 



§FFTJSIZE,RC 

-3,RC 

RC,IR1 

3,RC 

8,IR0 

@ DEST_ADDR , ARO 

ARO , ARl 

ARO , AR2 

AR0,AR3 

1 , ARO 

3, ARl 

5,AR2 

7 , AR3 

§SINE_TABLE,AR7 
*++AR7(IRl) ,R7 

*AR7,*AR2,R0 

*AR3,R7,R1 

R0,Rl,R2 

*AR7,*+AR2(IR0) ,R0 

R0,R1,R3 

*AR1,R3,R4 

*ARl,R3,R4 

R4,*AR2++(IR0) 

R2,*AR0,R4 

R4,*AR3++(IR0) 

*AR0,R2,R4 

R4,*AR1++(IR0) 

L00P3_B 
*AR3,R7,R1 
R4,*AR0++(IR0) 
R0,R1,R2 

*AR7 , *+AR2 ( IRO ) , RO 



Initialize table pointers. 

R7 - COS(2*pi/8) 

*AR7 - COS(2*pi/8) 

RO - X(I3)*COS 

R5 - X(I4)*COS 

R2 - [X(I3)*C0S + X(I4)*COS] 

R3 * -[X(I3)*C0S - X(I4)*COS] 
R4 - -X(I2) + R3 " 
R4 -'■ X(I2) + R3 - 

X(I3)« 

R4 m X(I1) - R2 " 



X(I4)« 

R4 - X(I'l) + R2 
X(I2)* 



X(I1)«- 
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SUBF3 


R0,R1,R3 


SUBF3 


*AR1,R3,R4 


ADDF3 


*AR1,R3,R4 


STF 


R4, *AR2++(IR0) 


SUBF3 


R2,*AR0,R4 


STF 


R4,*AR3++(IR0) 


ADDF3 


*AR0,R2,R4 


STF 


R4, *AR1++(IR0) 


MPYF3 


*AR3,R7,R1 


STF 


R4,*AR0++(IR0) 


ADDF3 


R0,R1,R2 


SUBF3 


R0,Rl,R3 


SUBF3 


*AR1,R3,R4 


ADDF3 


*AR1,R3,R4 


STF 


R4 , *AR2 


SUBF3 


R2, *AR0,R4 


STF 


R4 ,*AR3 


txuuc <J 


*AR0,R2,R4 


STF 


R4 , *AR1 


STF 


R4 , *AR0 
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Perform fourth FFT loop. 
Part A: 



AR1 



AR2 



AR3 



AR1 



LOOP 4 A: 



II 



12 



13 



14 



15 



0 
1 
2 
3 
4 
5 
6 
7 
8 
9 

10 
11 
12 
13 
14 
15 
16 
17 



X(I1) + X(I3) 



X(I1) - X(I3) 



+ -X(I4) 



LDI 


@DEST_ADDR,AR1 


LDI 


ARl , AR2 


LDI 


ARl , AR3 


ADD I 


8,AR2 


ADD I 


12,AR3 


LDI 


16,IR0 


LDI 


§FFTJ3IZ£,RC 


LSH 


-4,RC 


SUBI 


2,RC 


SUBF3 


*AR2,*AR1,R1 


ADDF3 


*AR2,*AR1,R2 


NEGF 


*AR3,R3 


RPTB 


LOOP4__A 


LDF 


*+AR2(IR0),R0 


| | STF 


R2, *AR1++(IR0) 


SUBF3 


R0,*AR1,R1 


| | STF 


R1,*AR2++(IR0) 


ADDF3 


R0 , *AR1 ,R2 


| | STF 


R3,*AR3++(IR0) 


NEGF 


*AR3,R3 


STF 


R2,*AR1 


| | STF 


Rl, *AR2 


STF 


R3,*AR3 



; R0 
Rl 
R2 
R3 



X(I3) 

X(I1) - X(I3)- 
X(I1) + X(I3)- 
-X(I4) . 



X(I1)4- 
X(I3)4- 
X(I4)«- 
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Part Bt 









0 










ARO 




11 


(3rd) 


1 


4r 


X[I1] + 


[X(I3)*COS+ 


X(I4)*SIN] 






11 


(2nd) 


2 




• 










11 


(1st) 


3 




• 












4 














12 


(1st) 


5 














12 


(2nd) 


6 




• 






AR1 




12 


(3rd) 


7 


4r 


X[I1] — 


[X(I3)*COS+ 


X(I4)*SIN] 








8 










AR2 




13 


(3rd) 


9 


4- 


-X[I2] - 


[X(I3)*COS- 


X(I4)*COS] 






13 


(2nd) 


10 










AR4 




13 


(1st) 


11 
















12 














14 


(1st) 


13 














«r A 
it 




14 










AR3 




14 


(3rd) 


15 


4- 


X[I2] - 


[X(I3)*SIN- 


X(I4)*COS] 








16 










ARO 






17 
















i 

▼ 













LDI 


@FFT_SIZE,RC 


LSH 


-4,RC 


LDI 


RC,IR1 


LDI 


2,IR0 


SUBI 


3,RC 


LDI 


@ DEST_ADDR , ARO 


LDI 


ARO, AR1 


LDI 


ARO , AR2 


LDI 


ARO , AR3 


LDI 


ARO , AR4 


ADDI 


1,AR0 


ADDI 


7,ARl 


ADDI 


9,AR2 


ADDI 


15,AR3 


ADDI 


11,AR4 


LDI 


@ S INE_JTABLE , AR7 


LDF 


*++AR7(IRl),R7 


LDI 


AR7 , AR6 


LDF 


*++AR6(IRl) ,R6 


LDI 


AR6,AR5 


LDF 


*++AR5(IRl),R5 



R7 * SIN('l*[2*pi/16]) 
*AR7 - COS(3*[2*pi/16] ) 

R6 - SIN(2*[2*pi/16] ) 
*AR6 - COS(2*[2*pi/16] ) 

R5 - SIN(3*[2*pi/16] ) 
*AR5 - COS(l*[2*pi/16] ) 



LDI 



16,IR1 
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MPYF3 *AR7 , *AR4 , RO 

MPYF3 *++AR2 ( IRO ) , R5 , R4 

MPYF3 * AR3 ( IRO ) , R5 , Rl 

MPYF3 *AR7,*AR3,R0 

| | ADDF3 R0,R1,R2 

MPYF3 *AR6,*-AR4,R0 

| | SUBF3 R4,R0,R3 

SUBF3 * AR1 ( IRO ) , R3 , R4 

ADDF3 *AR1,R3,R4 

STF R4 , *AR2 

SUBF3 R2 , *++ARO ( IRO ) , R4 

STF R4 , *AR3 

ADDF3 *AR0,R2,R4 

STF R4 , *AR1 

MPYF3 *++AR3,R6,Rl 

STF R4 , *ARO 

ADDF3 R0,R1,R2 

MPYF3 * AR5 , *-AR4 ( IRO ) , RO 

SUBF3 R0,R1,R3 

SUBF3 *++ARl,R3,R4 

ADDF3 *ARl,R3,R4 

STF R4 , *AR2 

SUBF3 R2,* AR0,R4 

STF 

STF R4 , *AR1 

MPYF3 * AR2,R7,R4 

STF R4 , *ARO 

MPYF3 *++AR3 f R7 f Rl 

MPYF3 *AR5 / *AR3 / R0 

ADDF3 RO ,Rl , R2 

MPYF3 *AR7 , *++AR4 ( IRl ) , RO 

SUBF3 R4 ,RO ,R3 

SUBF3 *++ARl,R3,R4 

ADDF3 *ARl,R3 f R4 

STF R4 f *AR2++(IR1) 

SUBF3 R2,* — ARO f R4 

STF R4 / *AR3++(IR1) 

ADDF3 *AR0,R2 f R4 

STF R4 f *AR1++(IR1) 

RPTB LOOP4_B 

MPYF3 *++AR2 ( IRO ) , R5 , R4 

STF R4,*AR0++(IR1) 

MPYF3 * AR3 ( IRO ) , R5 , Rl 

MPYF3 *AR7,*AR3,R0 

ADDF3 R0,R1,R2 

MPYF3 *AR6 , *-AR4 , RO 

SUBF3 R4,R0,R3 

SUBF3 * AR1 ( IRO ) , R3 , R4 

ADDF3 *AR1,R3,R4 



X(I3)*COS(3) 

X(I3)*SIN(3) 
X(I4)*SIN(3) 
X(I4)*COS(3) 
[X(I3)*C0S + X(I4)*SIN] 

-[X(I3)*SIN - X(I4)*COS] 
-X(I2) + R3- 
X(I2) + R3" 



RO = 

R4 = 
Rl = 
RO = 
R2 = 

R3 - 

R4 = 
R4 = 

X(I3)«« 

R4 = X(I1) - R2- 

X(I4)^ 

R4 = X(I1) + R2- 
X(I2)^ 



X(I1)<- 
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| | STF 


R4 , *AR2 — 




SUBF3 


R2,*++AR0(IR0),R4 




| | STF 


R4 , *AR3 




ADDF3 


*AR0,R2,R4 




| | STF 


R4,*AR1 




MPYF3 


*++AR3,R6,Rl 




| | STF 


R4 , *ARO 




ADDF3 


R0,R1,R2 




MPYF3 


*AR5 , *— AR4 ( IRO ) , RO 




| | SUBF3 


R0,R1,R3 




SUBF3 


*++ARl,R3,R4 




ADDF3 


*ARl,R3,R4 




| | STF 


R4 , *AR2 




SUBF3 


R2,* — AR0,R4 




1 1 STF 


R4,-*AR3 




ADDF3 


*AR0,R2,R4 




| | STF 


R4 , *AR1 




MPYF3 


* — AR2 , R7 , R4 




| | STF 


R4 , *ARO 




MPYF3 


*++AR3,R7,Rl 




MPYF3 


*AR5 , *AR3 ,RO 




| | ADDF3 


R0,R1,R2 


MPYF3 


*AR7 , *++AR4 ( IR1 ) , RO 




| | SUBF3 


R4,R0,R3 




SUBF3 


*++ARl,R3,R4 




ADDF3 


*ARl,R3,R4 




| | STF 


R4,*AR2++(IRl) 




SUBF3 


R2,* AR0,R4 




| | STF 


R4,*AR3++(IRl) 


L00P4_B: 


ADDF3 


*AR0,R2,R4 




| | STF 


R4,*ARl++(IRl) 




MPYF3 


*++AR2(IR0) ,R5,R4 




| | STF 


R4,*AR0++(IR1) 




MPYF3 


* AR3(IR0) ,R5,R1 




MPYF3 


*AR7 f *AR3 ,RO 




| | ADDF3 


R0,R1,R2 




MPYF3 


*AR6,*-AR4,R0 




| | SUBF3 


R4,R0,R3 




SUBF3 


* ARl(IRO) ,R3,R4 




ADDF3 


*ARl,R3,R4 




| | STF 


R4 , *AR2 




SUBF3 


R2,*++AR0(IR0),R4 




| | STF 


R4 , *AR3 




ADDF3 


*AR0,R2,R4 




| | STF 


R4 , *AR1 
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MPYF3 *++AR3,R6,Rl 

STF R4 , *ARO 

ADDF3 R0,R1,R2 

MPYF3 *AR5 , *-AR4 ( IRO ) , RO 

SUBF3 R0,R1,R3 

SUBF3 *++ARl,R3,R4 

ADDF3 *ARl,R3,R4 

STF R4 , *AR2 

SUBF3 R2,* — AR0,R4 

STF R4 , *AR3 

ADDF3 *AR0,R2,R4 

STF R4 , *AR1 

MPYF3 * — AR2,R7,R4 

STF R4 , *ARO 

MPYF3 *++AR3,R7,Rl 

MPYF3 *AR5,*AR3,R0 

ADDF3 R0,R1,R2 

SUBF3 R4,R0,R3 

SUBF3 *++ARl,R3,R4 

ADDF3 *AR1,R3,R4 

STF R4 , *AR2 

SUBF3 R2,* — AR0,R4 

STF R4 , *AR3 

ADDF3 *AR0,R2,R4 

STF R4 , *AR1 

STF R4,*AR0 
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Perform remaining FFT loops ( loop 4 onwards ) , 

LOOP 
1st 2nd 



AR1> 

A > 
B * 

AR2> 
AR3> 

C > 
D > 

AR4 _^ 

ARl^ 









X'(H) 


0 


0 


X(I1) (1st) 


1 


1 


X(I1) (2nd) 


2 


2 


X(I1) (3rd) 


3 


3 


• 






X' (12) 


8 


16 








v / 12 ) (3rd) 


1 1 


29 


X(I2) (2nd) 


14 


30 


X(I2) (1st) 


15 


31 


X'(I3) 


16 


32 


X(I3) (1st) 


17 


33 


X(I3) (2nd) 


18 


34 


X(I3) (3rd) 


19 


35 


• 






X'(I4) 


24 


48 








X(I4) (3rd) 


29 


61 


X(I4) (2nd) 


30 


62 


X(I4) (1st) 


31 


63 




32 


64 




33 


65 









X' (11) + 
X(I1) + 



X'(I3) 

[X(I3)*COS + 



X(I4)*SIN] 



X[I1] - [X(I3)*C0S + 
X' (II)- X' (13) 
-X[I2]- [X(I3)*SIN- 



X(I4)*SIN] 
X(I4)*C0S] 



-X'(I4) 



X[I2] - [X(I3)*SIN- X(I4)*C0S] 



LOOPS 



LDI 


@FFT_SIZE, IRO 


LSH 


-2 , IRO 


STI 


IRO t @ SEPARATION 


LSH 


-2 , IRO 


LDI 


5,R5 


LDI 


3,R7 


LDI 


16, R6 


LDI 


@ DE S T__ADDR , AR 5 


LDI 


@DEST_ADDR, ARl 


LSH 


-1,IR0 


LSH 


1,R7 


ADDI 


1,R7 


LSH 


1,R6 


LDI 


AR1,AR4 
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INLOP: 



ADD I 

LDI 

ADDI 

ADDI 

SUBI 

LDI 

SUBI 

LDI 
LDI 
LDI 

ADDF3 
SUBF3 
NEGF 

| | STF 
STF 

| | STF 

LDI 



IN BLK: 



MPYF3 

ADDF3 

SUBF3 

ADDF3 

STF 

SUBF3 

STF 

ADDF3 

STF 

RPTB 

LDF 

MPYF3 

STF 

MPYF3 

MPYF3 

SUBF3 

MPYF3 

ADDF3 

SUBF3 

ADDF3 

STF 

SUBF3 

STF 

ADDF3 

STF 



R7,AR1 
ARl , AR2 
2,AR2 
R6,AR4 
R7, AR4 
AR4 , AR3 
2,AR3 

§ S INE_TABLE , ARO 

R7,IR1 

R7,RC 



; ARl points at A. 

; AR2 points at B. 

; AR4 points at D. 

; AR3 points at C. 

; ARO points at SIN/COS table. 



* ARl ( IR1 ) , *++AR2 ( IR1 ) , RO 

* AR3 ( IR1 ) , *AR1++ , Rl 

* AR4,R2 

RO, *-ARl 
Rl , *AR2 — 
R2, *AR4++(IR1) 



RO - 
Rl = 
R2 - 
X'(I1) 
X'(I3) 
X'(I4) 



X'(I1) + X' 
X' (11) - X' 
-X'(I4) 
<4 


(13)— i 
(I3)n 






« 

< 1 





@ SEPARATION, IRl 



; IRl=SEPARATION 
BETWEEN SIN/COS TBLS 



SUBI 


3,RC 






MPYF3 


*++AR0(IR0),*AR4,R4 


; R4 


= X(I4)*SIN 


MPYF3 


*AR0,*++AR3,R1 


; Rl 


- X(I3)*SIN 


MPYF3 


*++AR0 ( IRl ) , *AR4 , RO 


; RO 


- X(I4)*COS 


MPYF3 


*AR0,*AR3,R0 


; RO 


= X(I3)*COS 


SUBF3 


R1,R0,R3 


; R3 


- -[X(I3)*SIN - X(I4)*COS] 



*++AR0 ( IRO ) , *-AR4 , RO 

R0,R4,R2 

*AR2,R3,R4 

*AR2,R3,R4 

R4 , *AR3++ 

R2,*AR1,R4 

R4 , *AR4 — 

*AR1,R2,R4 

R4 , *AR2 — 

INJ3LK 

*-AR0(IRl) ,R3 

*AR4,R3,R4 

R4,*AR1++ 

*AR3,R3,R1 

*AR0,*AR3,R0 

R1,R0,R3 

*++AR0 ( IRO ) , *-AR4 , RO 

R0,R4,R2 

*AR2,R3,R4 

*AR2,R3,R4 

R4,*AR3++ 

R2,*AR1,R4 

R4 , *AR4 — 

*AR1,R2 ,R4 

R4 , *AR2 — 



X(I3)*C0S + X(I4)*SIN 

R3 - X(I2) 

R3 + X(I2) 



R2 = 
R4 « 
R4 « 

X(I3) 4 

R4 * X(I1) - R2 

X(I4) < 

R4 - X(I1) + R2 
X(I2) < 



X(I1) <- 
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LDF 


*-ARO(IRl) ,R3 




MPYF3 


*AR4,R3,R4 




II 


STF 


R4,*AR1++ 


MPYF3 


*AR3 ,R3,R1 




MPYF3 


*AR0,*AR3,R0 




II 


SUBF3 


R1,R0,R3 


LDI 


R6, IR1 




ADDF3 


R0,R4,R2 




SUBF3 


*AR2,R3,R4 




ADDF3 


*AR2 R3 R4 




II 


STF 


R4,*AR3++(IRl) 




Rz , *AK1 |R4 




II 


STF 


R4,*AR4++(IR1) 


ADDF3 


*ARl,R2,R4 




II 


STF 


R4,*AR2++(IR1) 


STF 


R4, *AR1++(IR1) 


i 


SUBI3 


AR5,AR1,R0 




CMP I 


§FFTJ3IZE,R0 




BLTD 


INLOP 


? LOOP BACK TO 



INNER LOOP 

LDI §SINE_TABLE,ARO ; ARO POINTS TO 

SIN/COS TABLE 

LDI R7,IR1 
LDI R7,RC 

ADD I 1,R5 

CMPI GLOG_SIZE,R5 

BLED LOOP 

LDI @DEST__ADDR, AR1 

LSH -1,IR0 

LSH 1,R7 
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; Return to C environment. 

POP DP ; Restore C environment 

; variables . 



POP 


AR7 


POP 


AR6 


POP 


AR5 


POP 


AR4 


POPF 


R7 


POP 


R7 


POPF 


R6 


POP 


R6 


POP 


R5 


POP 


R4 


POP 


FP 


RETS 





• end 



* 

* No more. 

******************************************************** 
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Example 11-39. Real Inverse, Radix-2 FFT 

* Real Inverse FFT 

***************************************************************************** 

* FILENAME s ifft_rl.asm 

* 

* WRITTEN BY i Daniel Mazzocco 

* Texas Instruments, Houston 
* 

* DATE s 18th Feb 1992 
* 

* VERSION : 1.0 

* 



********************************************* 

* VER DATE COMMENTS 



* 1,0 18th Feb 92 Original release. Started from forward real FFT 

* routine written by Alex Tessarolo, rev 2.0 . 



***************************************************************************** 



* SYNOPSIS: 



int 



int 
int 
float 

float 
float 
int 



NOTEs 



ifft_rl( FFT_SIZE, LOG_SIZE, SOURCE_ADDR , 

DEST_ADDR, SINE_TABLE, BIT__REVERSE ); 



FFT_SIZE 
LOGJ3IZE 
*SOURCE_ADDR 

*DEST_ADDR 
*SINE_TABLE 
BIT REVERSE 



64, 128, 256, 512, 1024, ... 

6, 7, 8, 9, 10, ... 
Points to where data is originated 
and operated on. 

Points to where data will be stored. 
Points to the SIN/COS table. 
= 0, bit reversing is disabled. 
<> 0, bit reversing is enabled. 



1) If SOURCE_ADDR = DEST_ADDR, then in place bit 
reversing is performed, if enabled (more 
processor intensive). 

2) FFT__SIZE must be >= 64 (this is not checked). 
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DESCRIPTION: Generic function to do an inverse radix-2 FFT computation 
on the C30. 

The data array is FFT__SIZE long with real and imaginary 
points R and I as follows: 



SOURCE_ADDR[0] 



R(0) 
R(l) 
R(2) 
R(3) 



R(FFT_SIZE/2) 
I(FFT_SIZE/2 - 1) 



1(2) 

S0URCE_ADDR[FFT_SIZE-1] I ( 1 ) 

The output data array will contain only real values. 
Bit reversal is optionally implemented at the end 
of the function. 

The sine/cosine table for the twiddle factors is expected 
to be supplied in the following format: 



SINE_TABLE[0] 



^ sin(0*2*pi/FFT_SIZE) 
sin(l*2*pi/FFT_SIZE) 



sin ( ( FFT_SIZE/2-2 ) *2 *pi/FFT_SIZE ) 
SINE_TABLE[FFT_SIZE/2-l] ^ sin( (FFT_SIZE/2-l ) *2*pi/FFT_SIZE) 

NOTE: The table is the first half period of a sine wave. 

Stack structure upon call: 



* 


-FP(7) 


BIT_REVERSE 


* 


-FP(6) 


SINEJTABLE 


* 


-FP(5) 


DEST__ADDR 


* 


-FP(4) 


SOURCE_ADDR 


* 


-FP(3) 


LOG__SIZE 


* 


-FP(2) 


FFT_SIZE 


* 


-FP(1) 


returne 


* 


-FP(0) 


addr 


* 




old FP 



********************************************^ 
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NOTE: Calling C program can be compiled using either large 
or small model. 

WARNING: DP initialized only once in the program. Be wary 

with interrupt service routines. Make sure interrupt 
service routines save the DP pointer. 

WARNING: The SOURCE_ADDR must be aligned such that the first 
LOG__SIZE bits are zero (this is not checked by the 
program) . 



***************************************************************************** 

* 

* REGISTERS USED: RO, Rl, R2 , R3 , R4, R5, R6, R7 

* ARQ, AR1, AR2, AR3, AR4 , AR5, AR6, AR7 

* IRO, IR1 

* RC, RS, RE 

* DP 
* 

* MEMORY REQUIREMENTS: Program = 322 words (approximately) 

* Data * 7 words 

* Stack = 12 words 
* 

****************************************** 



* BENCHMARKS: Assumptions - Program in RAMO 



FFT Size 



— Reserved data in RAMO 

— Stack on primary /expansion bus RAM 

— Sine/cosine tables in RAMO 

— Processing and data destination in RAMI 

— Primary /expansion bus RAM, 0 wait state 

Bit Reversing Data Source Cycles (C30) 



1024 OFF RAMI 25892 approx. 

Note: This number does not include the C callable overheads. 
Add 57 cycles for these overheads. 
***************************************************************************** 



FP 


.set 


AR3 








•global 


_ifft_rl 




r 


FFTJSIZE: 


•usect 


".ifftdata", 


1 


} 


LOG_SIZEs 


.usect 


".ifftdata", 


1 




SOURCE_ADDR: 


•usect 


".ifftdata", 


1 




DEST_ADDR: 


.usect 


".ifftdata", 


1 




SINE__TABLE s 


•usect 


".ifftdata", 


1 




BIT_RE VERSE: 


•usect 


".ifftdata", 


1 




SEPARATION: 


•usect 


".ifftdata", 


1 
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Initialize C Function. 



• sect 



".iffttext" 



ifft rl: 



PUSH 
LDI 



FP 

SP,FP 



Preserve C environment. 



PUSH R4 

PUSH R5 

PUSH R6 

PUSHF R6 

PUSH R7 

PUSHF R7 

PUSH AR4 

PUSH AR5 

PUSH AR6 

PUSH AR7 

PUSH DP 

LDP FFT_SIZE ; Initialize DP pointer. 

LDI *-FP(2),R0 ; Move arguments from stack. 

STI RO,@FFT_SIZE 

LDI *-FP(3),R0 

STI R0,@LOGJ3IZE 

LDI *-FP(4),R0 

STI RO , @ SOURCE_ADDR 

LDI *-FP(5),R0 

STI RO,@DEST_ADDR 

LDI *-FP(6),R0 

STI RO,@SINE_TABLE 

LDI *-FP(7),R0 

STI RO , @BIT__RE VERSE 
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Perform last FFT loops first ( loop 2 onwards ) • 



LOOP 
1st 2nd 



AR1 ■ 

A 
B 



AR2 ■ 
AR3- 

C ■ 
D ■ 

AR4 • 
ARl 



x'(ii) 



X(I1) (1st) 



X(I1) (2nd) 



X(I1) (3rd) 



X' (12) 



X(I2) (3rd) 



X(I2) (2n5T 
X(I2) (1st) 



X' (13) 



TTsTT 



X(I3) (3rd) 



X' (14) 



X(I4) (3rd) 



X(I4) (2nd) 



X(I4) (1st) 



0 
1 
2 
3 



13 
14 
15 
16 
17 
18 
19 



29 
30 
31 
32 
33 



0 
1 
2 
3 



8 16 



29 
30 
31 
32 
33 
34 
35 



24 48 



61 
62 
63 
64 
65 



X' (11)+ X' (13) 
X(I1) + [X(I2) 



X' (12)* 2 



X[I4] - [X(I3) 
X'(I1)- X'(I3) 

[X(I1)-X(I2) ]*COS-[X(I3)+X(I4) ]*SIN 



-X' (I4)*2 



«- [X(I2)-X(I2) ]*SIN+ [X(I3)+X(I4) ]*COS 



LOOP: 



LDI 1,IR0 

LDI 4,R5 

LDI §FFT_SIZE,R7 

LSH -2,R7 

SUBI 1,R7 

LDI §FFTJ3IZE,R6 

LSH 1,R6 

LDI @ SOURCE_ADDR , AR5 

LDI @ SOURCE_ADDR , ARl 

LSH -1,R6 ; 

LDI ARl , AR4 

ADD I R7,AR1 ; 



Step between two consecutive sines 
Stage number from 4 to M. 

R7 is FFT_SIZE/4-l (ie 15 for 64 pts) 
and will be used to point at A & D. 
R6 will be used to point at D. 



R6 is FFT_SIZE at the 1st loop. 
ARl points at A. 



11-112 



Application-Oriented Operations 



INLOP: 



II 



II 



II 
II 
II 

II 



LDI 

ADD I 

ADDI 

SUBI 

LDI 

SUBI 

LDI 
LDI 

ADDF3 

SUBF3 

LDF 

STF 

MPYF 

LDF 

STF 

MPYF 

STF 

STF 

LDI 

LDI 
LSH 
SUBI 

SUBF3 

ADDF3 

MPYF 3 

LDF 

MPYF3 

SUBF3 

ADDF3 

STF 

MPYF3 

STF 

ADDF3 

MPYF 3 

STF 

SUBF3 



ARl , AR2 

2,AR2 

R6,AR4 

R7,AR4 

AR4,AR3 

2,AR3 

R7 , IR1 
R7,RC 



; AR2 points at B. 
; AR4 points at D. 
; AR3 points at C. 



* — ARl(IRl) ,* 
— AR3(IR1),R0 
*AR3, *ARl,Rl 
* — AR4 ,R2 
RO, *AR1++ 
-2.0,R2 
* — AR2,R3 
Rl,*AR3++ 
2.0,R3 

R3,*AR2++(IRl) 
R2, *AR4++(IR1) 

§FFT_SIZE,IR1 

§ S INE_TABLE , ARO 

-2 , IR1 

3,RC 



RO * X'(I1) + X'(I3) 
Rl ■ X' (II) - X' (13) -« 



X'(H) < 

R2 - -2*X' (14) 



*AR2 , *AR1 ,R3 
*AR1,*AR2,R2 
R3,*++AR0(IR0) 
* AR4 f R4 

R3,*++AR0(IR1) 

*AR3,R4,R3 

R4,*AR3,R2 

R2,*AR1++ 

R2 # *AR0 — (IR1) 

R3 , * AR2 — 

R4,R1,R3 

R2,*AR0,R1 

R3 , *AR4 — 

Rl f R0 / R4 



X'(I3) 4 

R3 - 2*X'(I2) 

X'(I2) 4 

X'(I4) A 



□ 



IRl^separation between SIN/ 
COS tbls 

ARO points at SIN/COS table. 



; R3 = 
; R2 = 
,R1; Rl 
; R4 = 
,R0; RO 
; R3 - 
; R2 = 
; X(I1) 
,R4; R4 
X(I2) 
R3 = 
Rl = 
X(I4) 
R4 



X(I1)-X(I2) 

X(I1)+X(I2) 

* R3*SIN 

X(I4) 

- R3*COS 

X(I4)-X(I3) 

X(I3)+X(I4) 



R2*COS 

< 

R3*SIN + R2*COS 
R2*SIN 
< 



R3*COS - R2*SIN 



RPTB 



IN BLK 



Software Applications 11-113 



Application-Oriented Operations 



IN BLK: 





SUBF3 




ADDF3 




MPYF3 


1 1 

1 1 


STF 




LDF 




MPYF3 


1 1 

1 1 


SUBF3 




ADDF3 


1 1 

1 1 


STF 




MPYF3 


II 


STF 




ADDF3 




MPYF3 


1 1 


STF 




SUBF3 




SUBF3 




ADDF3 




MPYF3 


1 1 

1 1 


STF 




LDF 




MPYF3 


1 1 

1 1 


SUBF3 




ADDF3 


1 1 

1 1 


STF 




MPYF3 


1 1 

1 1 


STF 




LDI 




ADDF3 




MPYF3 


1 1 


STF 




SUBF3 




NEGF 


1 1 


STF 




SUB 1 3 




CMP I 




BLTD 




NOP 




LDI 




LDI 




ADD I 




CMPI 




BLED 




LDI 




LSH 




LSH 



*AR2,*AR1,R3 
*ARl , *AR2 ,R2 
R3, *++AR0(IR0) 
R4,*AR3++ 
*AR4,R4 

R3,*++AR0(IRl) 

*AR3,R4,R3 

R4,*AR3,R2 

R2,*AR1++ 

R2,*AR0 — (IR1) 

R3 , * AR2 — 

R4,R1,R3 

R2,*AR0,Rl 

R3 , *AR4 — 

R1,R0,R4 

*AR2,*AR1,R3 
*AR1,*AR2,R2 
R3,*++AR0(IR0) 
R4 , *AR3++ 
*AR4,R4 

R3,*++AR0(IR1) 
*AR3,R4,R3 
R4,*AR3,R2 
R2,*AR1 

R2 , *AR0— ( IRl ) 

R3,*AR2 

R6,IR1 

R4,R1,R3 

R2,*AR0,R1 

R3,*AR4++(IRl) 

Rl,R0,R4 

*ARl++(IRl),R2 

R4,*AR3++(IR1) 

AR5,AR1,R0 

@FFT_SIZE,RO 

INLOP 

*AR2++(IRl) 

R7,IR1 

R7,RC 



; R3 - 

? R2 = 
,R1; Rl 
} X(I3) 
; R4 - 
,R0; RO 
; R3 - 
? R2 « 
J X(I1) 
,R4; R4 
X(I2) 
R3 - 
Rl « 
X(I4) 
R4 = 



X(I1)-X(I2) 
X(I1)+X(I2)- 
- R3*SIN 

X(I4) 

« R3*COS 
X(I4)-X(I3)- 
X(I3)+X(I4) 
< 



R2*COS 
« 



R3*SIN + R2*COS- 
R2*SIN 



R3*COS - R2*SIN 



R3 ■ X(I1)-X(I2) 
R2 ■ X(I1)+X(I2)- 
,R1; Rl - R3*SIN 
; X(I3) 
; R4 - X(I4) 
,R0; RO - R3*COS 
? R3 - X(I4)-X(I3)- 
; R2 « X(I3)+X(I4) 

; X(I1) 4 

,R4; R4 * R2*COS 

X(I2) « 



Get prepared for the next 

R3 * R3*SIN + R2*COS 

Rl ■ R2*SIN 

X(I4) « 

R4 = R3*COS - R2*SIN 

Dummy 

X(I3) 



Loop back to the inner loop 
Dummy 



1,R5 

@LOG_SIZE,R5 ; Next stage if any left 
LOOP 

§ SOURCE_ADDR ,ARl 

1,IR0 ; Double step in sinus table 

-1,R7 
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; Perform third FFT loop. 



? Part A: 
ARl 

AR2 

AR3 

AR3 



II I 0 ♦ X (II) + X(I3) 

1 

12 12 4- 2 * X(I2) 

3 

13 14 4- X (II) - X(I3) 

5 

14 |6 4- -2 * X(I4) 

7 



ARl 



LDI GS0URCE_ADDR,AR1 

LDI ARl , AR2 

LDI ARl , AR3 

LDI ARl , AR4 

ADDI 2,AR2 

ADD I 4,AR3 

ADDI 6,AR4 

LDI 8 , IRO 

LDI @FFT_SIZE,RC 

LSH -3,RC 

SUBI 1,RC 

LDI @ S INE_JTABLE , ARO ; ARO points at SIN/COS table. 



LOOP3 A: 



II 



II 



RPTB 


LOOP3_A 


LDF 


*AR3,R3 


ADDF3 


R3,*AR1,R0 


SUBF3 


R3,*AR1,R1 


LDF 


*AR4,R2 


STF 


R0,*ARl++(IR0) 


MPYF 


-2.0,R2 


LDF 


* AR2 , R3 


STF 


Rl,*AR3++(IR0) 


MPYF 


2.0,R3 


STF 


R3, *AR2++(IR0) 


STF 


R2, *AR4++(IR0) 



R0 = X'(I1) + X'(I3)- 
Rl = X'(H) - X'(I3)- 



X' (Il)« 

R2 - -2*X'(I4) 



X' (I3)« 

R3 - 2*X'(I2)- 

X' (12)4 

X' (14)4- 
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Part Bs 









0 






AR1 




11 


1 




X(I1) + X(I2) 








2 




AR2 




12 


3 




X(I1) - X(I3) 








4 




AR3 


> 


13 


5 




[X(I1)- X(I2)]*C0S- [X(I3)+ X(I4)]*SIN 








6 




AR4 




14 


7 




[X(I1)~ X(I2)]*SIN+ [X(I3)+ X(I4)]*C0S] 








8 




ARl 






9 




NOTE: COS(2*pi/8) - SIN(2*pi/8) 






i 

V 







LDI @ SOURCE_ADDR , AR 1 

LDI ARl , AR2 

LDI AR1,AR3 

LDI ARl , AR4 

ADDI 1, ARl 

ADDI 3, AR2 

ADDI 5,AR3 

ADDI 7,AR4 

LDI § S I NESTABLE , AR 7 

LDI @FFT_SIZE,RC 

LSH -3,RC 

LDI RC,IR1 

SUBI 2,RC 



; AR7 points at SIN/COS table. 
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LDF *AR2,R6 

LDF *AR3,R0 

ADDF3 R6,*AR1,R5 

SUBF3 R6,*ARl,R4 

SUBF3 R0,R4,R3 

ADDF3 RO ,R4 ,R2 

SUBF3 RO , *AR4 , Rl 

| | STF R5,*ARl++(IR0) 

ADDF3 R2,*AR4,R5 

| | STF Rl, *AR2++(IR0) 

MPYF3 R5, *++AR7(IRl) f Rl 

| | SUBF3 *AR4,R3,R2 

MPYF3 R2 , *AR7 ,RO 

| | STF Rl, *AR4++(IR0) 

RPTB LOOP3 B 



LOOP3 B: 





LDF 


*AR2,R6 


II 


STF 


R0,*AR3++(IR0) 




ADDF3 


R6, *AR1,R5 




LDF 


*AR3,R0 




SUBF3 


R6, *ARl,R4 




SUBF3 


R0,R4,R3 




ADDF3 


RO ,R4 ,R2 




SUBF3 


RO , *AR4 ,R1 


II 


STF 


R5,*ARl++(IR0) 




ADDF3 


R2 , *AR4 ,R5 


II 


STF 


Rl,*AR2++(IR0) 




MPYF3 


R5 , *AR7 f Rl 


II 


SUBF3 


*AR4,R3,R2 




MPYF3 


R2 , *AR7 f RO 


II 


STF 


R1 / *AR4++(IR0) 




STF 


RO f *AR3 



R6 = 
RO = 
R5 = 
R4 = 
R3 = 
R2 = 
Rl = 
X(I1 
R5 = 
X(I2 
Rl = 
R2 = 
RO = 
X(I4 



X(I2) 

X(I3) 

X(I1)+X(I2) 
X(I1)-X(I2) 
X(I1)-X(I2)-X(I3) 
X(I1)-X(I2)+X(I3) 
X(I4)-X(I3) 

) « 

X(I1)-X(I2)+X(I3)+X(I4) 

) * 



R5*SIN 

X(I1)-X(I2)-X(I3)-X(I4) 

R2*SIN 

) < 



R6 = 
X(I3 
R5 = 
RO = 
R4 = 
R3 = 
R2 = 
Rl = 
X(I1 
R5 = 
X(I2 
Rl = 
R2 = 
RO = 
X(I4 



X(I2) 
) < 



X(I1)+X(I2) 

X(I3) 

X(I1)-X(I2) 
X(I1)-X(I2)-X(I3) 
X(I1)-X(I2)+X(I3) 
X(I4)-X(I3) 

) < 

X(I1)-X(I2)+X(I3)+X(I4) 

) < 

R5*SIN^ 



X(I1)-X(I2)-X(I3)-X(I4) 
R2*SIN 

) « 



X(I3) 



Software Applications 11-117 



Application-Oriented Operations 



Perform first and second FFT loops. 



AR1 


* 


11 


0 


AR2 




12 


1 


AR3 




13 


2 


AR4 




14 


3 


AR1 






4 






i 
i 

▼ 





X(I1) 
X(I1) 
X(I1) 



X(I3) + 
X(I3) - 
X(I3) - 



X(I1) - X(I3) + 



2*X(I2) 
2*X(I2) 
2*X(I4) 
2*X(I4) 



LDI 


§SOURCE_ADDR, 


LDI 


ARl , AR2 


LDI 


ARl , AR3 


LDI 


ARl , AR4 


ADDI 


1 ,AR2 


ADDI 


2,AR3 


ADDI 


3,AR4 


LDI 


4,IR0 


LDI 


8FFT_SIZE,RC 


LSH 


-2,RC 


SUBI 


2,RC 
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II 



L00P1 2 J 



LDF 


*AR4 ,R6 


LDF 


*AR2,R7 


LDF 


*ARl,Rl 


MPYF 


2.0,R6 


MPYF 


2.0,R7 


SUBF3 


R6 , *AR3 ,R5 


SUBF3 


R5,Rl,R4 


SUBF3 


R7, *AR3,R5 


STF 


R4, *AR4++(IR0) 


ADDF3 


R5,Rl,R3 


ADDF3 


R6,*AR3,R4 


STF 


R3,*AR2++(IR0) 


SUBF3 


R4,Rl,R4 


ADDF3 


R7,*AR3,R0 


STF 


R4 , *AR3++( IRO ) 


ADDF3 


R0,R1,R0 


RPTB 


LOOPl_2 


LDF 


*AR4 , R6 


STF 


R0,*ARl++(IR0) 


MPYF 


2.0,R6 


LDF 


*AR2,R7 


LDF 


*ARl,Rl 


MPYF 


2 .0,R7 


SUBF3 


R6,*AR3,R5 


SUBF3 


R5,R1,R4 


SUBF3 


R7 , *AR3,R5 


STF 


R4,*AR4++(IR0) 


ADDF3 


R5,R1, R3 


ADDF3 


R6, *AR3,R4 


STF 


R3,*AR2++(IR0) 


SUBF3 


R4 , Rl , R4 


ADDF3 


R7, *AR3,R0 


STF 


R4,*AR3++(IR0) 


ADDF3 


R0,R1,R0 


STF 


R0,*AR1 



R6 - X(I4) 
R7 * X(I2) 
Rl = X(I1) 
R6 - 2 * X(I4) 
R7 - 2 * X(I2) 
R5 - X(I3) - 2*X(I4) 
R4 * X(I1)-X(I3)+2X(I4)— 
R5 - X(I3) - 2*X(I2) 

X(I4)«« 

R3 * X(I1)+X(I3)-2X(I2)— 
R4 - X(I3) + 2*X(I4) 

X(I2)« 

R4 = X(I1)-X(I3)-2X(I4)- 
RO - X(I3) + 2*X(I2) 

X(I3)« 

RO = X(I1)+X(I3)+2X(I2) 1 



R6 = 
X(I1 
R6 = 
R7 = 
Rl = 
R7 = 
R5 = 
R4 = 
R5 = 
X(I4 
R3 = 
R4 = 
X(I2 
R4 = 
RO = 
X(I3 
RO = 



X(I4) 

M 

2 * X(I4) 

X(I2) 

X(I1) 

2 * X(I2) 

X(I3) - 2*X(I4) 

X(I1)-X(I3)+2X(I4)- 

X(I3) - 2*X(I2) 

)< 

X(I1)+X(I3)-2X(I2)- 

X(I3) + 2*X(I4) 

)* 

X(I1)-X(I3)-2X(I4)- 

X(I3) + 2*X(I2) 
)« 



X(Il)+X(I3)+2X(I2)« 
LAST X(Il)« 
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Check bit reversing mode (on or off). 

B I T_RE VERSING - 0, then OFF (no bit reversing). 
B I T__RE VERS I NG <> 0, then ON. 

LDI @BIT_RE VERSE , R0 

CMPI 0,R0 

BZ MOVE DATA 



Check bit reversing type. 



If SourceAddr * DestAddr, then in place bit reversing. 
If SourceAddr <> DestAddr, then standard bit reversing. 



LDI § SOURCE_ADDR , R0 

CMPI @ DEST__ADDR , R0 

BEQ INJPLACE 



Bit reversing type 1 (from source to destination). 



NOTE: abs ( SOURCE_ADDR - DEST_ADDR) must be > FFTJSIZE, this is not checked. 



II 



LDI 


@FFT_SIZE,R0 


SUBI 


2,R0 


LDI 


@FFTJ3IZE,IR0 


LSH 


-1,IR0 


LDI 


§ SOURCE_ADDR , ARO 


LDI 


SDEST_ADDR,ARl 


LDF 


*AR0++,R1 


RPTS 


R0 


LDF 


*AR0++,R1 


STF 


R1,*AR1++(IR0)B 


STF 


R1,*AR1++(IR0)B 


BR 


DIVISION 



IRO - half FFT size. 
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; In-place bit reversing. 



; Bit reversing on even locations, 1st half 
} only. 



IN PLACE: 



II 



BITRV1 i 



LDI 
LSH 
LDI 

LDI 

LSH 

SUBI 

LDI 

LDI 

LDI 

NOP 

NOP 

LDF 

LDF 

CMP I 

LDFGT 

LDFGT 

RPTB 

LDF 

STF 

LDF 

STF 

CMP I 

LDFGT 

LDFGT 

STF 
STF 



§FFT_SIZE,IRO 

-2,IR0 ; IRO « quarter FFT size. 

2,IR1 

@FFT_SIZE,RC 

-2,RC 

3,RC 

@DEST__ADDR, ARO 

AR0,AR1 

AR0,AR2 



*AR1++(IR0)B 
*AR2++(IR0)B 
*++AR0(IRl) ,R0 
*ARl,Rl 

ARl , ARO ; 
R0,R1 

*AR1++(IR0)B,R1 



Xchange locations only if AR0<ARl , 



BITRV1 

*++AR0(IRl) ,R0 
RO , *AR0 
*ARl,Rl 

Rl,*AR2++(IR0)B 
ARl , ARO 
R0,Rl 
*AR1++(IR0)B,R0 

RO , *AR0 
Rl,*AR2 



Perform bit reversing on odd locations , 
2nd half only. 



LDI @FFT_SIZE,RC 

LSH -1,RC 

LDI @ DEST_ADDR , ARO 

ADDI RC, ARO 

ADD I 1,AR0 

LDI ARO, ARl 

LDI ARO , AR2 

LSH -1,RC 

SUBI 3,RC 

NOP *AR1++(IR0)B 

NOP *AR2++(IR0)B 

LDF *++AR0(IRl) ,R0 
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II 
II 

BITRV2 s 



LDF 
CMP I 
LDFGT 
LDFGT 

RPTB 

LDF 

STF 

LDF 

STF 

CMP I 

LDFGT 

LDFGT 

STF 
STF 



LDI 

LSH 

LDI 

LDI 

LDI 

ADDI 

ADDI 

LSH 

LDI 

SUBI 



*AR1,R1 

ARl , ARO ; 
RO,Rl 

*ARl++(IRO)B,Rl 
BITRV2 

*++ARO(IRl) ,RO 
RO , *ARO 
*AR1 ,R1 

Rl,*AR2++(IR0)B 
ARl, ARO 
RO ,R1 

*ARl++(IRO)B,RO 

RO , *ARO 
Rl , *AR2 



§FFT_SIZE,RC 

-1,RC 

RC,IRO 

§ DEST_ADDR , ARO 
ARO, ARl 

1, ARO 
IRO,ARl 
-1,RC 
RC,IRO 

2, RC 



Xchange locations only if AR0<AR1. 



Perform bit reversing on odd 
locations, 1st half only. 



LDF *AR0,R0 

LDF *AR1,R1 

RPTB BITRV3 

LDF *++AR0(IRl) ,R0 

|| STF R0,*AR1++(IR0)B 

BITRV3: LDF *AR1,R1 

| | STF Rl , *-AR0 ( IR1 ) 

STF RO , *AR1 

STF R1,*AR0 



BR 



DIVISION 
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Check data source locations. 

If SourceAddr « 

DestAddr, then do nothing. 
If SourceAddr <> 

DestAddr, then move data. 



MOVE DATAs 



II 



DIVISION: 



LAST LOOP: 



II 

II 



LDI 


@SOURCE_ADDR,R0 






CMP I 


@DEST_ADDR,RO 






BEQ 


DIVISION 






LDI 


@FFT_SIZE,RO 






SUBI 


2,R0 






LDI 


@ SOURCE_ADDR , ARO 






LDI 


@DEST_ADDR,AR1 






LDF 


*AR0++,Rl 






RPTS 


RO 






LDF 


*AR0++,Rl 






STF 


Rl,*ARl++ 






STF 


Rl , *ARl 






t.ht 

July J. 








T HT 








r JuUiil 


RO 


? 


pyn ac T.OG SIZE 


JrUonr 


KU 


? 


^2 M*?B'S saved 


JrvJJr 


t\\j 






NEGI 


RO 


? 


Neg exponent 


PUSH 


RO 






POPF 


RO 


? 


RU 38 1/rtl DJ.6JC1 


LDI 


@DEST_ADDR, AR1 






LDI 


6 DEST__ADDR , AKz 






NOP 


*AR2++ 






LDI 


@FFT_SIZE,RC 






LSH 


-1,RC 






SUBI 


2,RC 






MPYF3 


RO , *ARl , Rl 


} 


1st location 


RPTB 


LAST_LOOP 






MPYF3 


RO, *AR2,R2 


t 


2nd, 4th, 6th, . . . 


STF 


Rl,*ARl++(IR0) 






MPYF3 


R0,*ARl,Rl 


9 


3rd, 5th, 7th, . . . 


STF 


R2,*AR2++(IR0) 






MPYF3 


R0,*AR2,R2 


1 


Last location 


STF 


Rl , *ARl 






STF 


R2 , *AR2 
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; Return to C environment. 

POP DP ; Restore C environment variables. 

POP AR7 

POP AR6 

POP AR5 

POP AR4 

POPF R7 

POP R7 

POPF R6 

POP R6 

POP R5 

POP R4 

POP FP 

RETS 

• end 

* No more. 

***********************************^ 



The TMS320C3x quickly executes FFT lengths up to 1024 points (complex) 
or 2048 (real) , covering most applications, because it can do so almost entirely 
in on-chip memory. Table 11-1 and Table 11 -2 summarize the number of CPU 
clock cycles and the execution time required for FFT lengths between 64 and 
1 024 points for the four algorithms. 
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Table 11-1. TMS320C3x FFT Timing Benchmarks (Cycles) 



FFT Timing in Cycles 



Number of 
Points 


RADIX-2 
fComolexl 


RADIX-4 
fComDieiri 


RADIX-2 
(Real) 


RADIX-2 
{Real Inverse} 


64 


2770 


2050 


810 


1070 


128 


6170 




1760 


2370 


256 


13600 


10400 


3940 


5290 


512 


29740 




8860 


11740 


1024 


64570 


50670 


19820 


25900 


1024t 


39500 









t This benchmark is based on the Meyer and Schwarz program found in Digital Signal Processing Applications With the TMS320 
Family, Volume 3. 



Table 11-2. TMS320C3x FFT Timing Benchmarks (Milliseconds) 



FFT Timing in Milliseconds 



Number of 
Points 


RADIX-2 
(Complex) 


RADIX-4 
(Complex) 


RADIX-2 
(Real) 


RADIX-2 
(Real Inverse) 


64 


0.139 


0.103 


0.041 


0.054 


128 


0.309 




0.088 


0.119 


256 


0.680 


0.520 


0.197 


0.265 


512 


1.487 




0.443 


0.587 


1024 


3.229 


2.534 


0.991 


1.295 


1024t 


1.975 









t This benchmark is based on the Meyer and Schwarz program found in Digital Signal Processing Applications With the TMS320 
Family, Volume 3. 



11.4.5 Lattice Filters 

The lattice form is an alternative way of implementing digital filters; it has found 
applications in speech processing, spectral estimation, and other areas. In this 
discussion, the notation and terminology from speech processing applications 
are used. 

If H(z) is the transfer function of a digital filter that has only poles, A(z) = 1/H(z) 
will be a filter having only 0s, and it will be called the inverse filter. The inverse 
lattice filter is shown in Figure 11-5. These equations describe the filter in 
mathematical terms: 
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f (i,n)=f (i-1,n) + k(i)b(i-1,n-1) 
b(i,n) = b(i-1,n-1) + k(i)f (i — 1 , n) 

Initial conditions: 

f (0,n) = b(0,n)=x(n) 

Final conditions: 
y (n) = f ( p,n) 

In the above equation, f (i,n) is the forward error, b (i,n) is the backward error, 
k (i) is the i-th reflection coefficient, x (n) is the input, and y (n) is the output 
signal. The order of the filter (that is, the number of stages) is p. In the linear 
predictive coding (LPC) method of speech processing, the inverse lattice filter 
is used during analysis, and the (forward) lattice filter during speech synthesis. 



Figure 11-5. Structure of the Inverse Lattice Filter 



x(n)^ f(0, n) 



b(0, n) 



b(1,n) 



b(M,n) 



f(P, = y( n) 




Figure 11-6 shows the data memory organization of the inverse lattice-filter 
on the TMS320C3X. 

Figure 11-6. Data Memory Organization for Lattice Filters 



Low 
Address 



High 
Address 



Reflection 
Coefficients 


Backward 
Propagation Terms 


k(1) 




b(0,n-1) 


k(2) 




b(1,n-1) 


• 


• 


• 


• 


• 


• 


I k(p) | 


b(p-1,n-1) | 



Example 11-40 shows the implementation of an inverse lattice filter. 
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Example 11-40. Inverse Lattice Filter 



TITLE INVERSE LATTICE FILTER 



* 

* 

* SUBROUTINE LATINV 

* LATINV 
* 



LATTICE FILTER (LPC INVERSE FILTER - ANALYSIS) 



* TYPICAL CALLING SEQUENCE: 
* 



* 


load 


R2 


* 


load 


ARO 


* 


load 


AR1 


* 


load 


RC 




CALL 


LATINV 


* 






* 







ARGUMENT ASSIGNMENTS: 



ARGUMENT | FUNCTION 



R2 

ARO 

AR1 

RC 



f (0,n) = x(n) 

ADDRESS OF FILTER COEFFICIENTS (k(l)) 
ADDRESS OF BACKWARD PROPAGATION 

VALUES (b(0,n-l)) 
RC = p - 2 



REGISTERS USED AS INPUT: R2, ARO , ARl , RC 

REGISTERS MODIFIED: RO, Rl, R2, R3, RS, RE, RC, ARO, ARl 

REGISTER CONTAINING RESULT: R2 (f(p,n)) 



PROGRAM SIZE: 10 WORDS 



EXECUTION CYCLES: 13 + 3 * (p-1) 



.global LATINV 

* 

* i = 1 

LATINV MPYF3 *AR0, *ARl, RO 
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LDF R2,R3 

MPYF3 *ARO++ ( 1 ) , R2 , Rl 



2 <= i <= p 

RPTB LOOP 

MPYF3 *AR0 , *++ARl ( 1 ) , RO 
ADDF3 R2,R0,R2 



| | STFR3, *-ARl(l) 

LOOP MPYF3 *AR0++(1) ,R2,R1 
* 



P+l (CLEANUP) 



k(l) * b(0,n-l) -> RO 

Assume f(0,n) -> R2. 

Put b(0,n) - f(0,n) -> R3. 

k(l) * f(0,n) -> Rl 



k(i) * b(i-l,n-l) -> RO 
f (i-l-l,n)+k(i-l) 
*b(i-l-l,n-l) 
= f(i-l,n) -> R2 

b(i-l-l,b-l)+k(i-l)*£ (i-l-l,n) 

s n\ P** 

b(i-l-l,n) -> b(i-l-l,n-l) 



; k(i) * f(i-l,n) -> Rl 



ADDF3 R2,R0,R2 



ADDF3 *ARl r Rl, R3 
STF R3, *AR1 



f (p-l,n)+k(p)*b(p-l,n-l) 
= f (p,n) -> R2 

b(p-l,n-l)+k(p)*f (p-l,n) 
» b(p,n) -> R3 
b(p-l,n) -> b(p-l,n-l) 



RETURN SEQUENCE 

RETS ; RETURN 

* end 
• end 

The forward lattice filter is similar in structure to the inverse filter, as shown in 
Figure 11-7. 



Figure 11-7. Structure of the (Forward) Lattice Filter 
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These corresponding equations describe the lattice filter: 

f (i-1,n)=f (j, n )-k(i)b(i-1,n-1) 
b(i,n) = b(i-1,n-1) + k(i)f (i-1,n) 

Initial conditions: 

f (p,n) = x (n), b (i,n - 1) = 0 for i = 1 ,...,p 

Final conditions: 
y(n)=f(0,n) 

The data memory organization is identical to that of the inverse filter, as shown 
in Figure 11-6 on page 11-126. Example 11-41 shows the implementation of 
the lattice filter on the TMS320C3x. 



Example 11-41. Lattice Filter 



TITLE LATTICE FILTER 



SUBROUTINE LATICE 



* 


LOAD 


ARO 


* 


LOAD 


AR1 


* 


LOAD 


RC 


* 

* 


CALL 


LATICE 


* 


ARGUMENT 


ASSIGNMENTS: 


* 
* 


ARGUMENT 


| FUNCTION 
-+ 


* 


R2 


| F(P,N) - E(N) ■ EXCITATION 


* 


ARO 


j ADDRESS OF FILTER COEFFICIENTS (K(P) ) 


* 


AR1 


j ADDRESS OF BACKWARD PROPAGATION VALUES 




IRO 


I 3 


* 


RC 


j RC - P - 3 


* 


REGISTERS 


USED AS INPUT: R2 , ARO, ARl , RC 



REGISTERS MODIFIED: RO, Rl, R2 , R3, RS, RE, RC, ARO, ARl 
REGISTER CONTAINING RESULT: R2 (f(0,n)) 

STACK USAGE: NONE 

PROGRAM SIZE: 12 WORDS 

EXECUTION CYCLES: 15 + 3 * (P-2) 
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Application-Oriented Operations 



.global LATICE 

* 

LATICE MPYF3 *AR0 , *ARl , RO 

* } K(P) * B(P-1,N-1) -> RO 

; Assume F(P,N) -> R2 

SUBF3 R0,R2,R2 ; F(P, N)-K(P) *B(P-1,N-1 ) 

. = F(P-1,N) -> R2 

|| MPYF3 * — AR0(1) ,* — AR1(1) ,R0 

? K(P-l) * B(P-2,N-1) -> RO 

SUBF3 R0,R2,R2 ; F(P-1,N)-K(P-1)*B(P-2,N-1) 

} * F(P-2,N) -> R2 

|| MPYF3 *— AR0(1) , * — AR1(1) ,R0 

; K(P-2) * B(P-3,N-1) -> RO 

MPYF3 R2, *+AR0(l) ,R1 ; F(P-2,N) * K(P-l) -> Rl 

ADDF3 R1,*+AR1(1) ,R3 ; F(P-2,N) * K(P-l) + B(P-2,N-1) 

0 = TUP_1 -> R3 

1 — \ - — i i — 

; 1 <= I <= P-2 

RPTB LOOP 

SUBF3 R0,R2,R2 ; F(I,N) - K(I) * B(I-1,N-1) 

? - F(I-1,N) -> R2 

|| MPYF3 * — AR0(1) , * — AR1(1),R0 

; K(I-l) * B(I-2,N-1) -> RO 

STF R3 , *+ARl ( IRO ) ; B(I+1,N) -> B(I+1,N-1) 

|| MPYF3 R2,*+AR0(1) ,Rl ; F(I-1,N) * K(I) -> Rl 

LOOP ADDF3 Rl , *+ARl ( 1 ) , R3 ; F(I-1,N) * K(I) + B(I-1,N-1) 

? * B(I,N) -> R3 

STF R3,*+AR1(2) ; B(1,N) -> B(1,N-1) 

STF R2,*+AR1(1) ; F(0,N) -> B(0,N-1) 

* RETURN SEQUENCE 

RETS 

* 

* END 

• end 
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11.5 Programming Tips 

Programming style reflects personal preference. The purpose of this section 
is not to impose any particular style; rather, it is to highlight features of the 
TMS320C3x that can help to produce faster and/or shorter programs. The tips 
cover the C compiler, assembly language programming, and low-power-mode 
wakeup. 

11.5.1 C-Callable Routines 

The TMS320C3x was designed with a large register file, software stack, and 
large memory space to implement a high-level language (HLL) compiler easi- 
ly. The first such implementation supplied is a C compiler. Use of the C compil- 
er increases the transportability of applications that have been tested on large, 
general-purpose computers, and it decreases their porting time. 

For best use of the compiler, complete the following steps: 

1) Write the application in the high-level language. 

2) Debug the program. 

3) Determine whether it runs in real-time. 

4) If it doesn't, identify the places where most of the execution time is spent. 

5) Optimize these areas by writing assembly language routines that implement 
the functions. 

6) Call the routines from the C program as C functions. 

When writing a C program, you can increase the execution speed by maximiz- 
ing the use of register variables. For more information, refer to the 
TMS320C3x C Compiler Reference Guide. 

You must observe certain conventions when writing a C-callable routine. 
These conventions are outlined in the Runtime Environment chapter of the 
TMS320C3x C Compiler Reference Guide. Certain registers are saved by the 
calling function, and others need to be saved by the called function. The C 
compiler manual helps achieve a clean interface. The end result is the read- 
ability and natural flow of a high-level language combined with the efficiency 
and special-feature use of assembly language. 

11.5.2 Hints for Assembly Coding 

Each program has particular requirements. Not all possible optimizations will 
make sense in every case. You can use the suggestions presented in this sec- 
tion as a checklist of available software tools. 
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□ Use delayed branches. Delayed branches execute in a single cycle; reg- 
ular branches execute in four cycles. The following three instructions are 
also executed whether the branch is taken or not. If fewer than three in- 
structions can be used, use the delayed branch and append NOPs. Ma- 
chine cycles (time) are still being saved. 

□ Apply the repeat single/block construct. In this way, loops are achieved 
with no overhead. Nesting such constructs will not normally increase effi- 
ciency, so try to use the feature on the most often performed loop. Note 
that RPTS is not interruptible, and the executed instruction is not refetched 
for execution. This frees the buses for operands. 

□ Use parallel instructions. It is possible to have a multiply in parallel with 
an add (or subtract) and to have stores in parallel with any multiply or ALU 
operation. This increases the number of operations executed in a single 
cycle. For maximum efficiency, observe the addressing modes used in 
parallel instructions and arrange the data appropriately. 

□ Maximize the use of registers. The registers are an efficient way to ac- 
cess scratch-pad memory. Extensive use of the register file facilitates the 
use of parallel instructions and helps avoid pipeline conflicts when you use 
the registers in addressing modes. 

□ Use the cache. This is especially important in conjunction with external 
slow memory. The cache is transparent to the user, so make sure that it 
is enabled. 

□ Use internal memory instead of external memory. The internal 
memory (2K x 32 bits RAM and 4K x 32 bits ROM) is considerably faster 
to access. In a single cycle, two operands can be brought from internal 
memory. You can maximize performance if you use the DMA in parallel 
with the CPU to transfer data to internal memory before you operate on it. 

□ Avoid pipeline conflicts. If there is no problem with program speed, 
ignore this suggestion. For time-critical operations, make sure you do not 
miss any cycles because of conflicts. To identify conflicts, run the trace 
function on the development tools (simulator, emulators) with the program 
tracing option enabled. The tracing immediately identifies the pipeline 
conflicts. Consult the appropriate section of this user's guide for an expla- 
nation of the reason for the conflict. You can then take steps to correct the 
problem. 

The above checklist is not exhaustive, and it does not address the more de- 
tailed features outlined in other sections of this manual. To learn how to exploit 
the full power of the TMS320C3x, study the architecture, hardware configura- 
tion, and instruction set of the device. These subjects are described in earlier 
chapters. 
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11.5.3 Low-Power-Mode Wakeup Example 

There are two instructions by which the TMS320C31 is placed in the low power 
consumption mode: 

□ IDLE2 

□ LOPOWER 

The LOPOWER instruction will slow down the H1/H3 clock by a factor of 16 
during the read phase of the instruction. The MAXSPEED instruction will wake 
the device from the low-power mode and return it to full frequency during 
MAXSPEED's read cycle. However, the H1/H3 clock may resume with the 
phase opposite from before the clocks were shut down. 

The IDLE2 instruction has the same functions that the IDLE instruction has, 
except that the clock is stopped during the execute phase of the IDLE2 instruc- 
tion. The clock pin will stop with H1 high and H3 low. The status of all of the 
signals will remain the same as in the execute phase of the IDLE2 instruction. 
In emulation mode, however, the clocks will continue to run, and IDLE2 will op- 
erate identically to IDLE. The external interrupts INT(0-3) are the only signals 
\ that start the processor up from the mode the device was in. Therefore, you 
must enable the external interrupt before going to IDLE2 power-down mode. 
(See Example 11-42.) If the proper external interrupt is not set up before 
executing I DLE2 to power down, the only way to wake up the processor is with 
a device RESET. 



Example 11-42. Setup oflDLE2 Power-Down-Mode Wakeup 



* TITLE IDLE 2 POWER-DOWN MODE WAKEUP ROUTINE SETUP 
* 

* THIS EXAMPLE SETS UP THE EXTERNAL INTERRUPT 0, INTO, BEFORE 

* EXECUTING THE IDLE 2 INSTRUCTION. WHEN THE INTO SIGNAL IS RECEIVED 

* LATER, THE PROCESSOR WILL RESUME FROM ITS PREVIOUS 

* STATE. NOTE: THE "INTRPT" SECTION IS MAPPED FROM THE 

* ADDRESS 0 FROM THE RESET AND INTERRUPT VECTORS. 



sect "INTRPT" 

RESET .word START 

INTO .word INTOJESR 

INT1 .word INT1JCSR 

INT2 .word INT2_ISR 

INT3 .word INT3 ISR 



.text 



Reset vector 
INTO interrupt vector 
INTl interrupt vector 
INT2 interrupt vector 
INT3 interrupt vector 
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LDP 
LDI 
OR 



@SP_ADR 
§SP_ADR,SP 
Olh, IE 



Set up stack pointer 
Enable INTO 

Set GIE ■ 1 and stop clock 



IDLE 2 



INTO ISR 



RETI 



Return to instruction after IDLE 2 



There will be one cycle of delay while waking up the processor from the IDLE2 
power-down mode before the clocks start up. This adds one extra cycle from 
the time the interrupt pad goes low until the interrupt is taken. The interrupt pad 
needs to be low for at least two cycles. The clocks may start up in the phase 
opposite from before the clocks were stopped. 
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The TMS320C3x's advanced interface design can implement many system 
configurations. Its two external buses and DMA capability provide a parallel 
32-bit interface to external devices, while the interrupt interface, dual serial 
ports, and general-purpose digital I/O provide communication with many 
peripherals. 

This chapter describes how to use the TMS320C3x's interfaces to connect to 
various external devices. Specific discussions include implementation of par- 
allel interface to devices with and without wait states, use of general-purpose 
I/O, and system control functions. All interfaces shown in this chapter have 
been built and tested to verify proper operation and apply to the TMS320C30. 
Comparable designs for the other TMS320C3x devices can be implemented 
with appropriate logic. 

Major topics discussed in this chapter are as follows: 



Topic Page 

12.1 System Configuration Options Overview .•..".«.. 12-2 

12.2 Primary Bus Interface ........................................ 12-4 

12.3 Expansion Bus Interface . 12-19 

12.4 System Control Functions , , , „ 12-27 

12.5 Serial-Port Interface ................. 12-32 

12.6 Low-Power-Mode Interrupt Interface ,.„.,,....,.,,..,,,.•,,..., 12-36 

12.7 XDS Target Design Considerations 12-39 
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12.1 System Configuration Options Overview 

The various TMS320C3x interfaces connect to many different device types. 
Each of these interfaces is tailored to a particular family of devices. 

12.1.1 Categories of Interfaces on the TMS320C3x 

The TMS320C3x interface types fall into several categories, depending on the 
devices to which they are intended to be connected. Each interface comprises 
one or more signal lines that transfer information and control its operation. 
Figure 12-1 shows the signal line groupings for each of these various inter- 
faces. 

Figure 12-1. External Interfaces on the TMS320C3x 
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All of the interfaces are independent of one another, and you can perform dif- 
ferent operations simultaneously on each interface. 

The primary and expansion buses implement the memory-mapped interface 
to the device. The external direct memory access (DMA) interface allows ex- 
ternal devices to cause the processor to relinquish the primary bus and allow 
direct memory access. 
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12.1.2 Typical System Block Diagram 



The devices that can be interfaced to the TMS320C3x include memory, DMA 
devices, and numerous parallel and serial peripherals and I/O devices. 
Figure 12-2 illustrates a typical configuration of a TMS320C3x system with 
different types of external devices and the interfaces to which they are con- 
nected. 



Figure 12-2. Possible System Configurations 
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This block diagram constitutes essentially a fully expanded system. In an actual 
design, you can use any subset of the illustrated configuration as appropriate. 
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12.2 Primary Bus Interface 



The TMS320C3x uses the primary bus to access the majority of its 
memory-mapped locations. Therefore, typically, when a large amount of exter- 
nal memory is required in a system, it is interfaced to the primary bus. The ex- 
pansion bus (discussed in Section 1 2.3 on page 12-19) a ctua lly compr ises two 
mutually exclusive interfaces, controlled by the MSTRB an d IOSTR B signals, 
respectively. Cycles on the expansion bus controlled by the MSTRB signal are 
essentially equivalent to cycles on the primary bus, except that bank switching 
is not implemented on the expansion bus. Acco rdingly, t he discussion of pri- 
mary bus cycles in this section applies equally to MSTRB cycles on the expan- 
sion bus. 

Although you can use both the primary bus and the expansion bus to interface 
to a wide variety of devices, the devices most commonly interfaced to these 
buses are memories. Therefore, this section presents detailed examples of 
memory interface. 



Zero-wait-state read access time for the TMS320C3x is determined by the dif- 
ference between the cycle time (specification 10 in Table 13-12 on page 
13-31) and the sum ofthe times forH1 low to address valid (specification 14.1 
in Table 13-13 on page 13-34) and data setup before next H1 low (specifica- 
tion 15.1 in Table 13-13 on page 13-34): 



For example, for full-speed, zero-wait-state interface to any device, the 60-ns 
TMS320C3x requires a read access time of 30 ns from address stable to data 
valid. Because for most memories access time from chip select is the same 
as access time from address, it is theoretically possible to use 30-ns memories 
at full speed with the TMS320C3x-33. This requires that there be no delays 
between the processor and the memories. However, because of 
interconnection delays and because some gating is normally required for chip- 
select generation, this is usually not the case. Therefore, slightly faster memo- 
ries are required in most systems. 

Among currently available RAMs, there are two distinct categories of devices 
with different interface characteristics: 

□ RAMs without output enable control lines (OE), which include the one-bit- 
wide organized RAMs and most of the four-bit wide RAMs 

□ RAMs with OE controls, which include the byte-wide RAMs and a few of 
the four-bit wide RAMs 



12.2.1 Zero-Wait-State Interface to Static RAMs 
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Many of the fastest RAMs do not provide (5E control; they use chip-select (CS) 
controlled write cycles to ensure that data outputs do not turn on for write oper- 
ations. In CS-controlled write cycles, the write control line (WE) goes low be- 
fore CS goes low, and internal logic holds the outputs disabled until the cycle 
is completed. Using CS-controlled write cycles is an efficient way to interface 
fast RAMs without OE controls to the TMS320C30 at full speed. 

In the case of RAMs with OE controls, using this signal can add flexibility to 
many systems. Additionally, many of these devices can be interfaced by using 
CS-controlled write cycles with OE tied low in the same manner as with RAMs 
without OE controls. There are, however, two requirements for interfacing to 
OE RAMs hrthis manner. First, the RAM's OE input must be gated with chip 
select and WE internally so that the device's outputs do not turn on unless a 
read is being performed. Second, the RAM must allow its address inputs to 
change while WE is low; some RAMs specifically prohibit this. 

Figure 12-3 shows the TMS320C3x interfaced to Cypress Semiconductor's 
CY7C1 86 25-ns 8K x 8-bit CMOS static RAM with the OE control input tied low 
and using a CS-controlled write cycle. 
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Figure 12-3. TMS320C3x Interface to Cypress Semiconductor CY7C186 CMOS SRAM 
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In this circuit, the two chip selects on the RAM are driven by STRB and A23, 
which are ANDed together internally. A23 locat es the RAM at addresses 
OOOOOh through 03FFFh in external memory, and STRB establishes the CS- 
controlled write cycle. The WE control input is then driven by the TMS320C3x 
R/W signal, and the OE input is not used and id therefore connected to ground. 



The timing of read operations, shown in Figure 12-4, is very straightforward 
because the two chip-select inputs are driven directly. The read access time 
of the circuit is therefore the inverter propagation delay added to the RAM's 
chip-select access time, ort«| + t2 = 5 + 25 = 30 ns. This access time therefore 
meets the TMS320C3x-33's specified 30-ns read access time requirement. 
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Figure 12-4. Read Operations Timing 
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During write operations, as shown in Figure 12-5, the RAM's outputs do not 
turn on at all, because of the use of the chip-select controlled write cycles. The 
chip-select controlled write cycles are generated because R/W goes active 
(low) before the STRB term of the chip-select input. Because the RAM's output 
drivers are disabled whenever the WE input is low (regardless of the state of 
the OE input), bus conflicts with the TMS320C3x are automatically avoided 
with this interface. The circuit's data setup and hold times (t-| and t2 in the timing 
diagram) of approximately 50 and 20 ns, respectively, also easily meet the 
RAM's timing requirements of 10 and 0 ns. 
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Figure 12-5. Write Operations Timing 
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If you require more complex chip-select decode than can be accomplished in 
time to meet zero-wait-state timing, you should use wait states (see subsec- 
tion 12.2.2) or bank-switching techniques (see subsection 12.2.3). 

Note that the CY7C1 86's OE control is gated internally with CS; therefore, the 
RAM's outputs are not enabled unless the device is selected. This is critical 
if there are any other devices connected to the same bus; if there are no other 
devices connected to the bus, OE need not be gated internally with chip select. 

You can easily interface RAMs without OE controls to the TMS320C3x by us- 
ing an approach similar to that used with RAMs with OE controls. If only one 
bank of memory is implemented and no other devices are present on the bus, 
the memories' CS input can usually be connected to STRB directly. If several 
devices must be selected, however, a gate is generally required to AND the 
device select and STRB to drive the CS input to generate the chip-select con- 
trolled write cycles. In either case, the WE input is driven by the TMS320C3x 
R/W signal. Provided sufficiently fast gating is used, 25-ns RAMs can still be 
used. 

As with the case of RAMs with OE control lines, this approach works well if only 
a few banks of memory are implemented where the chip-select decode can 
be accomplished with only one level of gating. If many banks are required to 
implement very large memory spaces, bank switching can be used to provide 
for multiple bank select generation while still maintaining full-speed accesses 
within each bank. Bank switching is discussed in detail in subsection 12.2.3. 
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12.2.2 Ready Generation 

The use of wait states can greatly increase system flexibility and reduce hard- 
ware requirements over systems without wait-state capability. The 
TMS320C3x has the capability of generating wait states on either the primary 
bus or the expansion bus; both buses have independent sets of ready control 
logic.This subsection discusses ready generation from the perspective of the 
primary bus interface; however, wait-state operation on the expansion bus is 
similar to that on the primary bus. Therefore, these discussions also pertain 
to expansion bus operation. Accordingly, ready generation is not included in 
the specific discussions of the expansion bus interface. 

Wait states are generated on the basis of: 

□ the internal wait-state generator, 

□ the external ready input (RDY), or 

□ the logical AND or OR of the two. 

When enabled, internally generated wait states affect all external cycles, re- 
gardless of the address accessed. If different numbers of wait states are re- 
quired for various external devices, the external RDY input may be used to tai- 
lor wait-state generation to specific system requirements. 

If the logical AND (electrical OR) of the wait count and external ready signals 
is selected, the later of the two signals will control the internal ready signal, and 
both signals must occur. Accordingly, external ready control must be imple- 
mented for each wait-state device, and the wait count ready signal must be en- 
abled. 

If the logical OR (or electrical AND, since the signals are low true) of the exter- 
nal and internal wait-count ready signals is selected, the earlier of the two sig- 
nals will generate a ready condition and allow the cycle to be completed. Both 
signals need not be present. 

ORing of the Ready Signals 

The OR of the two ready signals can implement wait states for devices that 
require a greater number of wait states than are implemented with external 
logic (up to seven), This feature is useful, for example, if a system contains 
some fast and some slow devices. In this case, fast devices can generate a 
ready signal externally with a minimum of logic, and slow devices can use the 
internal wait counter for larger numbers of wait states. Thus, when fast devices 
are accessed, the external hardware responds promptly with a ready signal 
that terminates the cycle. When slow devices are accessed, the external hard- 
ware does not respond, and the cycle is appropriately terminated after the in- 
ternal wait count. 
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You can use the OR of the two ready signals if conditions occur that require 
termination of bus cycles prior to the number of wait states implemented with 
external logic. In this case, a shorter wait count is specified internally than the 
number of wait states implemented with the external ready logic, and the bus 
cycle is terminated after the wait count. This feature can also be a safeguard 
against inadvertent accesses to nonexistent memory that would never re- 
spond with ready and would therefore lock up the TMS320C3x. 

If the OR of the two ready signals is used, however, and the internal wait-state 
count is less than the number of wait states implemented externally, the exter- 
nal ready generation logic must have the ability to reset its sequencing to allow 
a new cycle to begin immediately following the end of the internal wait count. 
This requires that, under these conditions, consecutive cycles be from inde- 
pendently decoded areas of memory and that the external ready generation 
logic be capable of restarting its sequence as soon as a new cycle begins. 
Otherwise, the external ready generation logic might iose synchronization with 
bus cycles and therefore generate improperly timed wait states. 

ANDIng of the Ready Signals 

The AND of the two ready signals can be used to implement wait states for de- 
vices that are equipped to provide a ready signal but cannot respond quickly 
enough to meet the TMS320C3x's timing requirements. In particular, if these 
devices normally indicate a ready condition and, when accessed, respond with 
a wait until they become ready, the logical AND of the two ready signals can 
be used to save hardware in the system, In this case, the internal wait counter 
can provide wait states initially and become ready after the external device has 
had time to send a not ready indication. The internal wait counter then remains 
ready until the external device also becomes ready, which terminates the 
cycle. 

Additionally, the AND of the two ready signals can extend the number of wait 
states for devices that already have external ready logic implemented but re- 
quire additional wait states under certain unique circumstances. 

External Ready Generation 

In the implementation of external ready generation hardware, the particular 
technique employed depends heavily on the specific characteristics of the sys- 
tem. The optimum approach to ready generation varies, depending on the rel- 
ative number of wait-state and non-wait-state devices in the system and on the 
maximum number of wait states required for any one device. The approaches 
discussed here are intended to be general enough for most applications and 
are easily modifiable to comprehend many different system configurations. 
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In general, ready generation involves the following three functions: 

□ Segmentating the address space in some fashion to distinguish fast and 
slow devices 

□ Generating properly timed ready indications 

□ Logically ORing all of the separate ready timing signals together to con- 
nect to the physical ready input 

Segmentation of the address space is required to obtain a unique indication 
of each particular area within the address space that requires wait states. This 
segmentation is commonly implemented in a system in the form of chip-select 
generation. In many cases, you can use chip-select signals to initiate wait 
states; however chip-select decoding considerations might occasionally pro- 
vide signals that will not allow ready input timing requirements to be met. In this 
case, you could make coarse address space segmentation on the basis of a 
small number of address lines, where simpler gating allows signals to be gen- 
erated more quickly. In either case, the signal indicating that a particular area 
of memory is being addressed is normally used to Initiate a ready or wait-state 
indication. 

Once the region of address space being accessed has been established, a 
timing circuit of some sort is normally used to provide a ready indication to the 
processor at the appropriate point in the cycle to satisfy each device's unique 
requirements. 

Finally, since indications of ready status from multiple devices are typ ically 
present, the signals are logically ORed by using a single gate to drive the RDY 

input. 

Ready Control Logic 

You can take one of two basic approaches in the implementation of ready con- 
trol logic, depending on the state of the ready input between accesses. If RDY 
is low between accesses, the processor is always ready unless a wait state is 
required; if RDY is high between accesses, the processor will always enter a 
wait state unless a ready indication is generated. 

If RDY is low between accesses, control of full-speed devices is straightfor- 
ward; no action is necessary because ready is always active unless otherwise 
required. Devices requiring wait states, however, must drive ready high fast 
enough to meet the input timing requirements. Then, after an appropriate 
delay, a ready indication must be generated. This can be quite difficult in many 
circumstances because wait-state devices are inherently slow and often re- 
quire complex select decoding. 
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If RDY is high between accesses, zero-wait-state devices, which tend to be 
inherently fast, can usually respond immediately with a ready indication. Wait- 
state devices might delay their select signals appropriately to generate a 
ready. Typically, this approach results in the most efficient implementation of 
ready control logic. Figure 1 2-6 shows a circuit of this type, which can be used 
to generate zero, one, or two wait states for multiple devices in a system. 



Figure 12-6. Circuit for Generation of Zero, One, or Two Wait States for Multiple Devices 
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Example Circuit 

In this circuit, full-speed devices drive ready directly through the 74AS21 , and 
the two flip-flops delay wait-state devices' select signals one or two H1 cycles 
to provide one or two wait states. 

Considering the TMS320C3x-33's ready delay time of eight ns following ad- 
dress, zero-wait-state devices must use ungated address lines directly to drive 
the input of the 74AS 21 , sin ce this gate contributes a maximum propagation 
delay of six ns to the RDY signal. Thus, zero-wait-state devices should be 
grouped together within a coarse segmentation of address space if other de- 
vices in the system require wait states. 

With this circuit, devices requiring wait states might take up to 36 ns from a val- 
id address on the TMS320C3x to provide inputs to the 74AS20's inputs. This 
usually allows sufficient time for any decoding required in generating select 
signals for slow er devic es in the system. For example, the 74ALS138, driven 
by address and STRB, can generate select decodes in 22 ns, which easily 
meets the TMS320C3x-33's timing requirements. 

With this circuit, unused inputs to either the 74AS20S or the 74AS21 should 
be tied to a logic high level to prevent noise from generating spurious wait 
states. 

If more than two wait states are required by devices within a system, other ap- 
proaches can be employed for ready generation. If between three and seven 
wait states are required, additional flip-flops can be included in the same man- 
ner shown in Figure 12-6, or internally generated wait states can be used in 
conjunction with external hardware. If more than seven wait states are re- 
quired, an external circuit using a counter may be used to supplement the ca- 
pabilities of the internal wait-state generators. 



12.2.3 Bank Switching Techniques 

The TMS320C3x's programmable bank switching feature can greatly ease 
system design when large amounts of memory are required. Because, in gen- 
eral, devices take longer to release the bus than they take to drive the bus, 
bank switching is used to provide a period of time for disabling all device se- 
lects that would not be present otherwise (refer to Section 7.4 on page 7-30 
for further information regarding bank switching). During this interval, slow de- 
vices are allowed time to turn off before other devices have the opportunity to 
drive the data bus, thus avoiding bus contention. 
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When bank switching is enabled, any time a portion of the high order a ddress 
lines changes, as defined by the content s of th e BNKCMPR register, STRB 
goes high for one full H1 cycle. Provided STRB is included in chip-select de- 
codes, this causes all devices t o be di sabled during this period. The next bank 
of devices is not enabled until STRB goes low again. 

In general, bank switching is not required during writes, because these cycled 
alw ays exh ibit an inherent one-half H1 cycle setup of address information be- 
fore STRB goes low. Thus, when you use bank switching for read/write de- 
vices, a minimum of half of one H1 cycle of address setup is provided for all 
accesses. Therefore, large amounts of memory can be implemented without 
wait states or extra hardware required for isolation between banks. Also, note 
that access time for cycles during bank switching is the same as that for cycles 
without bank switching, and, accordingly, full-speed accesses can still be ac- 
complished within each bank. 

When you use bank switching to implement large multiple-bank memory sys- 
tems, an important consideration is address line fanout. Besides parametric 
specifications for which account must be made, AC characteristics are also 
crucial in memory system design. With large memory arrays, which commonly 
require large numbers of address line inputs to be driven in parallel, capacitive 
loading of address outputs is often quite large. Because all TMS320C3x timing 
specifications are guaranteed up to a capacitive load of 80 pF, driving greater 
loads will invalidate guaranteed AC characteristics. Therefore, it is often nec- 
essary to provide buffering for address lines when driving large memory ar- 
rays. AC timings for buffer performance can then be derated according to man- 
ufacturer specifications to accommodate a wide variety of memory array sizes. 

The circuit shown in Figure 1 2-7 illustrates the use of bank switching with Cy- 
press Semiconductor's CY7C1 85 25-ns 8K x 8 CMOS static RAM. This circuit 
implements 32K 32-bit words of memory with one-wait-state accesses within 
each bank. 

A wait state is required with this implementation of bank memory because of 
the added propagation delay presented by the address bus buffers used in the 
circuit. The wait state is not a function of the memory organization of multiple 
banks or the use of bank switching. When bank switching is used, memory ac- 
cess speeds are the same as without bank switching, once bank boundaries 
are crossed. Therefore, no speed penalty is paid when bank switching is used, 
except for the occasional extra cycle inserted when bank boundaries are 
crossed. Note, however, that if the extra cycle inserted when bank boundaries 
are crossed does impact software performance significantly, you can often re- 
structure code to minimize bank boundary crossings, thereby reducing the ef- 
fect of these boundary crossings on software performance. 
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The wait state for this bank memory is generated by using the wait-state gener- 
ator circuit presented in the previous section. Because A23 is the signal that 
enables the entire bank memory system, the inverted version of this signal is 
ANDed with STRBto derive a one-wait-state device select. This signal is then 
connected in the circuit along with the other one-wait-state device selects. 
Thus, any time a bank memory access is made, one wait state is generated. 

Each of the four banks in this circuit is selected by using a decode of A1 5-A1 3 
generated by the 74AS138 (see Figure 12-8). With the BNKCMPR register 
set to OBh, the banks will be selected on even 8K-word boundaries starting at 
location 080A000h in external memory space. 

Figure 12-7. Bank Switching for Cypress Semiconductor's CY7C185 
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Figure 12-8. Bank Memory Control Logic 
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The 74ALS2541 buffers used on the address lines are necessary in this design 
because the total capacitive load presented to each address line is a maximum 
of 1 6 x 1 0 pF or 1 60 pF (bank memory plus zero-wait-state static RAM), which 
exceeds the TMS320C3x rated capacitive loading of 80 pF. Using the 
manufacturer's derating curves for these devices at a load of 80 pF (the load 
presented by the bank memory) predicts propagation delays at the output of 
the buffers of a maximum of 16 ns. The access time of a read cycle within a 
bank of the memory is therefore the sum of the memory access time and the 
maximum buffer propagation delay, or 25 + 1 6 = 41 ns, which, since it falls be- 
tween 30 and 90 ns, requires one wait state on the TMS320C3x-33. 

The 74ALS2541 buffers offer one additional system-performance enhance- 
ment in that they include 25-ohm resistors in series with each individual buffer 
output. These resistors greatly improve the transient response characteristics 
of the buffers, especially when driving CMOS loads such as the memories 
used here. The effect of these resistors is to reduce overshoot and ringing, 
which is common when driving predominantly capacitive loads such as 
CMOS. The result is reduced noise and increased immunity to latch-up in the 
circuit, which in turn results in a more reliable memory system. Having these 
resistors included in the buffers eliminates the need to put discrete resistors 
in the system, which is often required in high-speed memory systems. 

This circuit cannot be implemented without bank switching because data out- 
put's turn-on and turn-off delays cause bus conflicts. Here, the propagation 
delay of the 74AS1 38 is involved only during bank switches, when there is suf- 
ficient time between cycles to allow new chip selects to be decoded. 

The timing of this circuit for read operations using bank switching is shown in 
Figure 1 2-9. With the BNKCMPR register set to OBh, when a bank switch oc- 
curs, the bank address on address lines A23-A1 3 is updated during the extra 
H1 cycle while STRB is high. Then, after chip-select decodes have stabilized 
and the previously selected bank has disabled its outputs, STRB goes low for 
the next read cycle. Further accesses occur at normal bus timings with one 
wait state, as long as another bank switch is not necessary. Write cycles do 
not require bank switching due to the inherent address setup provided in their 
timings. 
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Figure 12-9. Timing for Read Operations Using Bank Switching 
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This timing is summarized in Table 12-1 . 
Table 12-1. Bank Switching Interface Timing 



Timer Interval 


Event 


Time Period 


t1 


H1 falling to address valid/STRB rising 


14 ns 


t2 


Address valid to select delay 


10 ns 


t3 


Memory disable from STRB 


10 ns 


t4 


H1 falling to STRB 


10 ns 


t5 


STRB to select delay 


4.5 ns 


t6 


Memory output enable delay 


3 ns 



t Timing for the TMS320C3X-33 



12-18 



Expansion Bus Interface 



12.3 Expansion Bus Interface 

The TMS320C30's expansion bus interface provides a second complete par- 
allel bus, which can be used to implement data transfers concurrently with (and 
independently of) operations on the primary bus. The expansion bus com- 
prises two mutually exclusive interfaces controlled by the MSTRB and 
IOSTRB signals, respectively. This subsecti on discusses interface to the ex- 
pansion bus using IOSTRB cycles; MSTRB cycles are essentially equivalent 
in timing to primary bus cycles and are discussed in Section 12.2, beginning 
on page 12-4. This section applies to TMS320C30 devices. 

Unlike the primary bus, both read and write cycles on the I/O portion of the ex- 
pansion bus are two H1 cycles in duration and exhibit the same timing. The 
XR/W signal is high for reads and low for writes. Since I/O accesses take two 
cycles, many periph erals that require wait states if interfaced either to the pri- 
mary bus or by using MSTRB can be used in a system without the need for wait 
states. Specifically, in cases where there is only one device on the expansion 
bus, devices with address access times greater than the 30 ns required by the 
primary bus, but less than 59 ns, can be interfaced to the I/O bus of the 
TMS320C30-33 without wait states. 

12.3.1 A/D Converter Interface 

A/D and D/A converters are commonly required in DSP systems and interface 
efficiently to the I/O expansion bus. These devices are available in many 
speed ranges and with a variety of features. While some might require one or 
more wait states on the I/O bus, others can be used at full speed. 

Figure 12-10 illustrates a TMS320C30 interface to an Analog Devices 
AD1678 analog-to-digital converter. The AD1678 is a 12-bit, 5-^is converter 
that allows sample rates up to 200 kHz and has an input voltage range of 10 
volts, bipolar or unipolar. The converter is connected according to manufactur- 
er's specifications to provide 0- to +1 0-volt operation. This interface illustrates 
a common approach to connecting devices such as this to the TMS320C30, 
Note that the interface requires only a minimum amount of control logic. 
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Figure 12-10. Interface to AD1678 A/D Converter 
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The AD1 678 is a very flexible converter and is configurable in a number of dif- 
ferent operating modes. These operating modes include byte or word data for- 
mat, continuous or noncontinuous conversions, enabled or disabled chip-se- 
lect function, and programmable end-of-conversion indication. This interface 
utilizes 1 2-bit word data format, rather than byte format, to be compatible with 
the TMS320C3x. Noncontinuous conversions are selected so that variable 
sample rates can be used; continuous conversions occur only at a rate of 200 
kHz. With noncontinuous conversions, the host processor determines the con- 
version rate by initiating conversions through write operations to the converter. 
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The chip-select function is enabled, so the chip-select input is required to be 
active when accessing the device. Enabling the chip select function is neces- 
sary to allow a mechanism for the AD1 678 to be isolated from other peripheral 
devices connected to the expansion bus. To establish the desired operating 
modes, the SYNC and 1 2/8 inputs to the converter are pulled high and EOCEN 
is grounded, as specified in the AD1678 data sheet. 

In this application, the converter's chip select is driven by XA12, which maps 
this device at 804000h in I/O address space. Conversions are initiated by writ- 
ing any data value to the device, and the conversion results are obtained by 
reading from the device after the conversion is completed. To generate the de- 
vice's start conversion (SC) and output enable (OE) inputs, IOSTRB is ANDed 
with XR/W. Therefore, the converter is selected whenever XA1 2 is low; OE is 
driven when reads are performed, while SC is driven when writes are per- 
formed. 

As with many A/D converters, at the end of a read cycle the AD1 678 data out- 
put lines enter a high-impedance state. This occurs after the output enable 
(OE) or read control line goes inactive. Also common with these types of de- 
vices is that the data output buffers often require a substantial amount of time 
to actually attain a full high-impedance state. When used with the 
TMS320C30-33, devices must have their outputs fully disabled no later than 
65 ns following the rising edge of IOSTRB because the TMS320C30 will begin 
driving the data bus at this point if the next cycle is a write. If this timing is not 
met, bus conflicts between the TMS320C30 and the AD 1 678 might occur, po- 
tentially causing degraded system performance and even failure due to dam- 
aged data bus drivers. The actual disable time for the AD1 678 can be as long 
as 80 ns; therefore, buffers are required to isolate the converter outputs from 
the TMS320C30. The buffers used here are 74LS244s t hat are en abled when 
the AD1678 is read and turned off 30.8 ns following IOSTRB going high. 
Therefore, the TMS320C30-33 requirement of 65 ns is met. 

When data is read following a conversion, the AD1678 takes 100 ns after its 
OE control line is asserted to provide valid data at its outputs. Thus, including 
the propagation delay of the 74LS244 buffers, the total access time for reading 
the converter is 118 ns. This requires two wait states on the TMS320C30-33 
expansion I/O bus. 

The two wait states required in this case are implemented using software wait 
states; however, depending on the overall system configuration, it might be 
necessary to implement a separate wait-state generator for the expansion bus 
(refer to subsection 12.2.2 on page 12-9). This would be the case if multiple 
devices that required different numbers of wait states were connected to the 
expansion bus. 
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Figure 12-11 shows the timing for read operations between the 
TMS320C30-33 and the AD1678. At the beginning of the cycle, the address 
and XR/W lines become valid t| = 1 0 ns followin g the fallin g edge of H 1 . Then, 
after X2 = 1 0 ns from the next rising edge of Hi , IOSTRB goes low, beginning 
the active porti on of the read cycle. After t3 = 5.8 ns (the control logic propaga- 
tion delay) , the IOR signal goes low, asserting the OE input to the AD1 678. The 
74LS244 buffers take = 30 ns to enable their outputs, and then, following 
the converters access delay and the buffer propagation delay (t 5 = 100 + 18 
= 118 ns), data is provided to the TMS320C 30. This p rovides approximately 
46 ns of data setup before the rising edge of IOSTRB. Therefore, this design 
easily satisfies the TMS320C30-33's requirement of 1 5 ns of data setup time 
for reads. 



Figure 12-11. Read Operations Timing Between the TMS320C30 andAD1678 
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Unlike the primary bus, read and write cycles on the I/O expansion bus are 
timed the same with the exception that XR/W is high for reads and low for 
writes and that the data bus is driven by the TMS320C30 during writes. When 
writing to the AD1 678, the 74LS244 buffers do not turn on and no data is trans- 
ferred. The purpose of writing to the converter is only to generate a pulse on 
the converter's SC input, which initiates a conversion cycle. When a conver- 
sion cycle is completed, the AD1 678's EOC output is used to generate an inter- 
rupt on the TMS320C30 to indicate that the converted data can be read. 

It should be noted that for different applications, use of TLC1 225 or TLC1 550 
A/D converters from Texas Instruments can be beneficial. The TLC1225 is a 
self-calibrating 12-bit-plus-sign bipolar or unipolar converter, which features 
10-iiS conversion times. The TLC1550 is a 10-bit, 6-^is converter with a high- 
speed DSP interface. Both converters are parallel-interface devices. 
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12.3.2 D/A Converter Interface 

In many DSP systems, the requirement for generating an analog output signal 
is a natural consequence of sampling an analog waveform with an A/D conver- 
ter and then processing the signal digitally internally. Interfacing D/A conver- 
ters to the TMS320C30 on the expansion I/O bus is also quite straightforward. 

As with A/D converters, D/A converters are also available in a number of vari- 
eties. One of the major distinctions between various types of D/A converters 
is whether or not the converter includes both latches to store the digital value 
to be converted to an analog quantity, and the interface to control those 
latches. With latches and control logic included with the converter, interface 
design is often simplified; however, internal latches are often included only in 
slower D/A converters. 

Because slower converters limit signal bandwidths, the converter used in this 
design was selected to allow a reasonably wide range of signal frequencies 
to be processed, and to illustrate the technique of interfacing to a converter that 
uses external data latches. 

Figure 12-12 shows an interface to an Analog Devices AD565A digital-to- 
analog converter. This device is a 12-bit, 250-ns current output DAC with an 
on-chip 10-volt reference. Using an offchip current-to-voltage conversion cir- 
cuit connected according to manufacturers specifications, the converter ex- 
hibits output signal ranges of 0 to +1 0 volts, which is compatible with the con- 
version range of the A/D converter discussed in the previous section. 
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Figure 12-12. Interface Between the TMS320C30 and the AD565A 
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Because this DAC essentially performs continuous conversions based on the 
digital value provided at its inputs, periodic sampling is maintained by periodi- 
cally updating the value stored in the external latches. Therefore, between 
sample updates, the digital value is stored and maintained at the latch outputs 
that provide the input to the DAC. This results in the analog output remaining 
stable until the next sample update is performed. 
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The external data latches used in this interface are 74LS377 devices that have 
both clock and enable inputs. These latches serve as a convenient interface 
with the TMS320C30; the enable inputs provide a device select function, and 
the clock inputs latch the data. Therefore, with the enable input driven by in- 
verted XA12 and the clock input by IOW, which is the AND of IOSTRB and 
XR/W, data will be stored in the latches when a write is performed to I/O ad- 
dress 805000h. Reading this address has no effect on the circuit. 

Figure 1 2-1 3 shows a timing diagram of a write operation to the D/A converter 
latches. 



Figure 12-13. Write Operation to the D/A Converter Timing Diagram 
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Because the write is actually being performed to the latches, the key timings 
for this operation are the timing requirements for these devices. For proper op- 
eration, these latches require simply a minimal setup and hold time of data and 
control signals with respect to the rising edge of the clock input. Specifically, 
the latches require a data setup time of 20 ns, enable setup of 25 ns, disable 
setup of 10 ns, and data and enable hold times of 5 ns. This design provides 
approximately 60 ns of enable setup, 30 ns of data setup, and 7.2 ns of data 
hold time. Therefore, the setup and hold times provided by this design are well 
in excess of those required by the latches. The key timing parameters for this 
interface are summarized in Table 12-2. 
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Table 12-2. Key Timing Parameter for D/A Converter Write Operation 



Time 




Time 


Intamol 


CVolil 


DArinHt 


ti 


HI falling to address valid 


10 ns 




XA12toXA12 delay 


5 ns 


t 3 


HI rising to IOSTRB falling 


10 ns 


u 


IOSTRB to IOW delay 


5.8 ns 


ts 


Data setup to IOW 


30 ns 


te 


Data hold from IOW 


7.2 ns 



t Timing for the TMS320C30-33 
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12 .4 System Control Functions 

Several aspects of TMS320C3x system hardware design are critical to overall 
system operation. These include such functions as clock and reset signal gen- 
eration and interrupt control. 

12.4.1 Clock Oscillator Circuitry 

You can provide an input clock to the TMS320C3x either from an external clock 
input or by using the onboard oscillator. Unless special clock requirements ex- 
ist, the onboard oscillator is generally a convenient method for clock genera- 
tion. This method requires few external components and can provide stable, 
reliable clock generation for the device. 

Figure 12-14 shows the external clock generator circuit designed to operate 
the TMS320C3x at 33.33 MHz. Since crystals with fundamental oscillation fre- 
quencies of 30 MHz and above are not readily available, a parallel-resonant 
third-overtone crystal is used with crystal frequency of 13 MHz. 



Figure 12-14. Crystal Oscillator Circuit 
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In a third-overtone oscillator, the crystal fundamental frequency must be 
attenuated so that oscillation is at the third harmonic. This is achieved with an 
LC circuit that filters out the fundamental, thus allowing oscillation at the third 
harmonic. The impedance of the LC circuit must be inductive at the crystal fun- 
damental and capacitive at the third harmonic. The impedance of the LC circuit 
is represented by 



z((o) = jcoL + 



1 

jcoC 



(3) 



Therefore, the LC circuit has a 0 at 



cop =4= (4) 
/LC 
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At frequencies significantly lower than cop* the 1/(coC) term in (3) becomes the 
dominating term, while coL can be neglected. This is expressed as 

z(co) = r-^L for co < co P 

jcoC (3) 



In (5), the LC circuit appears conductive at frequencies lower than cop On the 
other hand, at frequencies much higher than cop the coLterm is the dominant 
term in (3), and 1/(coC) can be neglected. This is expressed as 

z(co) = jcoL for co < co P 

(3) 



The LC circuit in (6) appears increasingly inductive as the frequency increases 
above cop This is shown in Figure 12-15, which is a plot of the magnitude of 
the impedance of the LC circuit of Figure 12-14 versus frequency. 

Figure 12-15. Magnitude of the Impedance of the Oscillator LC Network 
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Based on the discussion above, the design of the LC circuit proceeds as fol- 
lows: 

1) Choose the pole frequency cop slightly above the crystal fundamental. 

2) The circuit now appears inductive at the fundamental frequency and ca- 
pacitive at the third harmonic. 

In the oscillator of Figure 12-14 on page 12-27, choose fp = 13 MHz, which 
is slightly above the fundamental frequency of the crystal. Choose C = 1 5 pF. 
Then, using equation (4), L = 10 jiH. 



The reset input controls initialization of internal TMS320C3x logic and also 
causes execution of the system initialization software. For proper system ini- 
tialization, the reset signal must be applied for at least ten H1 cycles, i.e., 600 
ns for a TMS320C3x operating at 33.33 MHz. Upon power-up, however, it can 
take 20 ms or more before the system oscillator reaches a stable operating 
state. Therefore, the power-up reset circuit should generate a low pulse on the 
reset line for 1 00 to 200 ms. Once a proper reset pulse has been applied, the 
processor fetches the reset vector from location 0, which contains the address 
of the system initialization routine. Figure 12-1 6 shows a circuit that will gener- 
ate an appropriate power-up reset circuit. 



12.4.2 Reset Signal Generation 



Figure 12-16. Reset Circuit 



TMS320C3X 



RS 



+5V 




74LS14 74LS14 



<$> — ^ 



DGND 



Hardware Applications 1 2-29 



System Control Functions 



The voltage on the reset pin (RESET) is controlled by the R-| Ci network. After 
a reset, this voltage rises exponentially according to the time constant R-jCi , 
as shown in Figure 12-17. 

Figure 12-1 7. Voltage on the TMS320C30 Reset Pin 



Voltage a 




The duration of the low pulse on the reset pin is approximately t-j , which is the 
time it takes for the capacitor C-| to be charged to 1 .5 V. This is approximately 
the voltage at which the reset input switches from a logic 0 to a logic 1 . The 
capacitor voltage is expressed as 



V = V 



CC 



1-e 



(7) 



where t = Ci is the reset circuit time constant. Solving equation (7) for t re- 
sults in 



t = -R.|C.|ln 



1 - 



'CC 



(8) 



Setting the following: 
R 1 =100KQ 
Ci = 4.7 nF 
V CC = 5 V 
V = V 1 = 1.5V 

results in t = 167 ms. Therefore, the reset circuit of Figure 12-16 provides a 
low pulse of long enough duration to ensure the stabilization of the system os- 
cillator. 
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Note that if synchronization of multiple TMS320C3xs is required, all proces- 
sors should be provided with the same input clock and the same reset signal. 
After power-up, when the clock has stabilized, all processors can be synchro- 
nized by generating a falling edge on the common reset signal. Because it is 
the falling edge of reset that establishes synchronization, reset must be high 
for at least ten H1 cycles initially. Following the falling edge, reset should re- 
main low for at least ten H1 cycles and then be driven high. This sequencing 
of reset can be accomplished using additional circuitry based on either RC 
time delays or counters. 
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12.5 Serial-Port Interface 

For applications such as modems, speech, control, instrumentation, and ana- 
log interface for DSPs, a complete analog-to-digital (A/D) and digital-to-analog 
(D/A) input/output system on a single chip might be appropriate. The 
TLC32044 analog interface circuit (AIC) integrates a bandpass, switched-ca- 
pacitor, antialiasing input filter, 14-bit resolution A/D and D/A converters, and 
a low-pass, switched-capacitor, output-reconstruction filter, all on a single 
monolithic CMOS chip. The TLC32044 offers numerous combinations of mas- 
ter clock input frequencies and conversion/sampling rates, which can be 
changed via digital signal processor control. 

Four serial port modes on the TLC32044 allow direct interface to TMS320C3x 
processors. When the transmit and receive sections of the AIC are operating 
synchronously, it can interface to two SN54299 or SN74299 serial-to-parallel 
shift registers. These shift registers can then interface in parallel to the 
TMS320C30, to other TMS320 digital processors, or to external FIFO circuitry. 
Output data pulses inform the processor that data transmission is complete or 
allow the DSP to differentiate between two transmitted bytes. A flexible control 
scheme is provided so that the functions of the AIC can be selected and ad- 
justed coincidental^ with signal processing via software control. Refer to the 
TLC32044 data sheet for detailed information. 

When you interface the AIC to the TMS320C3x via one of the serial ports, no 
additional logic is required. This interface is shown in Figure 1 2-1 8. The serial 
data, control, and clock signals connect directly between the two devices, and 
the AlC's master clock input is driven from TCLKO, one of the TMS320C3x's 
internal timer outputs. The AlC's WORD/BYTE input is pulled high, selecting 
16-bit serial port transfers to optimize serial port data transfer rate. The 
TMS320C3x's XFO pin, configured as an output, is connected to the AlC's re- 
set (RST) input to allow the AIC to be reset by the TMS320C3x under program 
control. This allows the TMS320C3x timer and serial port to be initialized be- 
fore beginning conversions on the AIC. 
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Figure 12-18. AIC to TMS320C30 Interface 
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To provide the master clock input for the AIC, the TCLKO timer is configured 
to generate a clock signal with a 50% duty cycle at a frequency of f(H1)/4 or 
4.1 67 MHz. To accomplish this, the global control register for timer 0 is set to 
the value 3C1h, which establishes the desired operating modes. The period 
register for timer 0 is set to 1 , which sets the required division ratio for the H1 
clock. 

To properly communicate with the AIC, the TMS320C30 serial port must be 
configured appropriately by initializing several TMS320C30 registers and 
memory locations. First, reset the serial port by setting the serial port global 
control register to 2170300h. (The AIC should also be reset at this time. See 
description below of resetting the AIC via XFO.) This resets the serial port logic, 
configures the serial port operating modes, including data transfer lengths, 
and enables the serial port interrupts. This also configures another important 
aspect of serial port operation: polarity of serial port signals. Because active 
polarity of all serial port signals is programmable, it is critical to set appropriate- 
ly the bits in the serial port global control register that control the polarity. In this 
application, all polarities are set to positive except FSX and FSR, which are 
driven by the AIC and are true low. 

The serial port transmit and receive control registers must also be initialized 
for proper serial port operation. In this application, both of these registers are 
set to 1 1 1 h, which configures all of the serial port pins in the serial port mode, 
rather than the general-purpose digital I/O mode. 
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When the operations described above are completed, interrupts are enabled, 
and, provided that the serial port interrupt vector(s) are properly loaded, serial 
port transfers can begin after the serial port is taken out of reset. You can do 
this by loading E170300h into the serial port global control register. 

To begin conversion operations on the AIC and subsequent transfers of data 
on the serial port, first reset the AIC by setting XFO to 0 at the beginning of the 
TMS320C3x initialization routine. Set XFO to 0 by setting the TMS320C3x IOF 
register to 2. This sets the AIC to a default configuration and halts serial port 
transfers and conversion operations until reset is set high. Once the 
TMS320C3x serial port and timer have been initialized as described above, 
set XFO high by setting the IOF register to 6. This allows the AIC to begin oper- 
ating in its default configuration, which in this application is the desired mode. 
In this mode, all internal filtering is enabled, sample rate is set at approximately 
6.4 kHz, and the transmit and receive sections of the device are configured to 
operate synchronously. This mode of operation is appropriate for a variety of 
applications; if a 5.1 84-MHz master clock input is used, the default configura- 
tion results in an 8-kHz sample rate, which makes this device ideal for speech 
and telecommunications applications. 

In addition to the benefit of a convenient default operating configuration, the 
AIC can also be programmed for a wide variety of other operating configura- 
tions. Sample rates and filter characteristics can be varied, and numerous con- 
nections in the device can be configured to establish different internal architec- 
tures by enabling or disabling various functional blocks. 

To configure the AIC in a fashion different from the default state, you must first 
send the device a serial data word with the two LSBs set to 1 . The two LSBs 
of a transmitted data word are not part of the transferred data information and 
are not set to 1 during normal operation. This condition indicates that the next 
serial transmission will contain secondary control information, not data. This 
information is then used to load various internal registers and specify internal 
configuration options. Four different types of secondary control words are dis- 
tinguished by the state of the two LSBs of the transferred control information. 
Note that each transferred secondary control word must be preceded by a data 
word with the two LSBs set to 1 . 

The TMS320C3x can communicate with the AIC either synchronously or 
asynchronously, depending on the information in the control register. The op- 
erating sequence for synchronous communication with the TMS320C30 
shown in Figure 12-19 is as follows: 

1) The FSX or FSR pin is brought low. 

2) One 1 6-bit word is transmitted, or one 1 6-bit word is received. 

3) The FSX o r F SR pin is brought high. 

4) The EODX or EODR pin emits a low-going pulse. 
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Figure 12-19. Synchronous Timing of TLC32044 to TMS320C3x 
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For asynchronous communication, the operating sequence is similar, but FSX 
and FSR do not occur at the same time (see Figure 1 2-20). After each receive 
and transmit operation, the TMS320C30 asserts an internal receive (RINT) 
and transmit (XINT) interrupt, which can be used to control program execution. 

Figure 12-20. Asynchronous Timing of TLC32044 to TMS320C30 
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12.6 Low-Power-Mode Interrupt Interface 

This section explains how to generate interrupts when the IDLE2 power-down 
mode is used. 



The execution of the IDLE2 instruction causes the H1 and H3 processor clocks 
to be held at a constant level until the occurrence of an external interrupt. To 
use the TMS320C31 IDEL2 power management feature effectively, interrupts 
must be generated with or without the presence of the H1 clock. For normal 
(non-IDLE2) operation, however, the interrupt inputs must be synchronized 
with the falling edge of the H1 clock. An interrupt must satisfy the following 
conditions: 



□ It must meet the setup time on the falling edge of H1, and 

□ It must be at least one cycle and less than two cycles in duration. 

For an interrupt to be recognized during IDLE2 operation and turn the clocks 
back on, it must first be held low for one H1 cycle. The logic in Figure 12-21 
can be used to generate an interrupt signal to the TMS320C31 with the correct 
timing during non-IDLE2 and IDLE2 operation. Figure 1 2-21 shows the inter- 
rupt circuit, which uses a 1 6R4 PLD to generate the appropriate interrupt sig- 
nal. 



Figure 12-21. Interrupt Generation Circuit for Use With IDLE2 Operation 
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Example 12-1 shows the PLD equations for the 16R4 using the ABEL™ lan- 
guage. This implementation makes the following assumptions regarding the 
interrupt source: 

□ The interrupt source is at least one H1 cycle in duration. One H1 cycle is 
required to turn the H1 clock on again. 

□ The interrupt source is a low-going pulse or a falling edge. If the interrupt 
source stays active for more than one H1 cycle, it is regarded as the same 
interrupt request and not a new one. 
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Notice that the interrupt is driven active as soon as the interrupt source goes 
active. It goes inactive again on detection of two H3 rising edges. These two 
rising edges ensure that the interrupt is recognized during normal operation 
and after the end of IDLE2 operation (when the clocks turn on again). The inter- 
rupt goes inactive after the two H3 clocks are counted and does not go inactive 
again until after the interrupt source again goes inactive and returns to active. 



Example 12-1. State Machine and Equations for the Interrupt Generation 16R4 PLD 

MODULE INTERRUPT_GENERATION 

TITLE' INTERRUPT_GENERATION FOR IDLE 2 AND NON-IDLE2 TMS320C31A 
TMS320C31' 



c3xu5 



device 



'P16R4' 



"inputs 
h3 

intsrc_ 

"output 
intx 



Pin 1; 
Pin 2; 



'Interrupt source 



Pin 12; "Interrupt input signal to the TMS320C31 



sync_src_Pin 14; "Internal signal used to synchronize the 

"input to the HI clock 
same_ Pin 15; "Keeps track if the new interrupt source 

"has occurred. If active , no new interrupt 

"has occurred. 



"This logic makes the following assumptions: 
"The duration of the interrupt source is at least one HI 
"cycle in duration. It takes one HI cycle to turn the HI 
"clock on again. 

"The interrupt source is pulse- or level-triggered. If the 
"source stays active after being asserted, it is regarded 
"as the same interrupt request and not a new one. 



"Name Substitutions for Test Vectors and Equations 



c,H,L,X = .C. , ,1,0, .X. ; 

source = !intsrc_; 

sync = 1 sync__src_; 

samesrc = lsame__; 

c3xint = lintx__; 

"state bits 

outstate = [ samesrc, sync ] ; 



idle « A b00 
sync_st = A b01 
wait » A bl0 



"synchronize state 

"wait for interrupt source to go inactive 
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state_diagram outstate 



state idle: 
if 

else 



(source) then sync_st 
idle; 



state sync_st: 

if (source) then wait 

else idle; 



state wait: 
if 

else 



(source) then wait 
idle; 



equations 

i intx 

@page 



= (source w sync) & i samesrc ; 



"Test interrupt generation logic 
test vectors 



([he, 


source] ~> 


[ outstate . c3xint ] 


[ c, 


L 


] -> 


[idle, 


L 


[ L, 


H 


] -> 


[idle, 


H 


[ c, 


H 


] — > 


[ sync__st, 


H 


[ o, 


L 


] -> 


[idle, 


L 


I c, 


L 


] — > 


[ idle, 


L 


[ L, 


H 


] -> 


[idle, 


H 


[ L # 


H 


] -> 


[idle, 


H 


[ C, 


H 


] — > 


[sync_st, 


H 


[ c, 


L 


] ~> 


[ idle, 


L 


I o, 


H 


] -> 


[sync_st, 


H 


[ o, 


H 


] -> 


[wait, 


L 


I c, 


H 


] 


[wait, 


L 


[ c r 


L 


] -> 


[idle, 


L 


[ L, 


H 


] —> 


[idle, 


H 


[ L # 


H 


] -> 


[idle, 


H 


[ L # 


H 


] ~ > 


[idle, 


H 


end 




interrupt_generation 





"check start from idle 

"test normal interrupt operation 



"test coming out of idle2 operation 



"test same source 



r 'test idle2 operation 
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12.7 XDS Target Design Considerations 



12.7.1 Designing Your MPSD Emulator Connector (12-Pin Header) 

The 'C3x uses a modular port scan device (MPSD) technology to allow com- 
plete emulation via a serial scan path of the 'C3x. To communicate with the 
emulator, your target system must have a 12-pin header (2 rows of 6 pins) with 
the connections that are shown in Figure 12-22.To use the target cable, sup- 
ply the signals shown in Table 1 2-3 to a 1 2-pin header with pin 8 cut out to pro- 
vide keying. For the latest information, refer to the JTAG/MPSD Emulation 
Technical Reference (literature number SPDU079). 

Figure 12-22. 12-Pin Header Signals and Header Dimensions 
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Header Dimensions: 

Pin-to-pin spacing, 0.100 in. (X,Y) 
Pin width: 0.025-in. square post 
Pin length: 0.235-in. nominal 



t These signals should always be pulled up with separate 20-kQ resistors to Vqq. 

$ While the corresponding female position on the cable connector is plugged to prevent improper 

connection, the cable lead for pin 8 is present in the cable and is grounded as shown in the 

schematics and wiring diagrams in this document. 



Table 12-3. 12-Pin Header Signal Descriptions and Pin Numbers 



XDS510 

Signal 


Description 


'C30 

Pin Number 


'C31 

Pin Number 


EMUO 


Emulation pin 0 


F14 


124 


EMU1 


Emulation pin 1 


E15 


125 


EMU2 


Emulation pin 2 


F13 


126 


EMU3 


Emulation pin 3 


E14 


123 


H3 


'C3x H3 


A1 


82 


PD 


Presence detect. Indicates that the emulation cable is con- 
nected and that the target is powered up. PD should be tied to 
Vqc » n the target system. 



Although you can use other headers, recommended parts include: 

straight header, unshrouded DuPont Connector Systems 

part numbers: 6561 0-1 1 2 
65611-112 
37996-112 
67997-112 
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Figure 12-23 shows a portion of logic in the emulator pod. Note that 33-Q re- 
sistors have been added to the EMUO, EMU1 , and EMU2 lines; this minimizes 
cable reflections. 



Figure 12-23. Emulator Cable Pod Interface 
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12.7.2 MPSD Emulator Cable Signal Timing 

Figure 12-24 shows the signal timings for the emulator pod. Table 12-4 de- 
fines the timing parameters. The timing parameters are calculated from values 
specified in the standard data sheets for the emulator and cable pod and are 
for reference only. Texas Instruments does not test or guarantee these timings. 
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Figure 12-24. Emulator Cable Pod Timings 
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Table 1 2-4. Emulator Cable Pod Timing Parameters 



No. 



Reference 



Description 



Min 



Max Unit 



1 


*H3 min 
*H3 max 


H3 period 


35 


200 ns 


2 


*H3 high min 


H3 high pulse duration 


15 


ns 


3 


*H3 low min 


H3 low pulse duration 


15 


ns 


4 


td (EMUO, 1,2) 


EMUO, 1,2 valid from H3 low 


7 


23 ns 


5 


tsu (EMU3) 


EMU3 setup time to H3 high 


3 


ns 


6 


t hd (EMU3) 


EMU3 hold time from H3 high 


11 


ns 



12.7.3 Connections Between the Emulator and the Target System 

It is extremely important to provide high-quality signals between the emulator 
and the 'C3x on the target system. In many cases, the signal must be buffered 
to produce high quality. The need for signal buffering can be divided into three 
categories, depending on the placement of the emulation header: 

□ No signals buffered. In this situation, the distance between the emulation 
header and the 'C3x should be no more than two inches. (See 
Figure 12-25.) 
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Figure 12-25. Signals Between the Emulator and the 'C3x With No Signals Buffered 

— 2 inches or less — 
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□ Transmission signals buffered. In this situation, the distance between 
the emulation header and the 'C3x is greater than two inches but less than 
six inches. The transmission signals, H3 and EMU3, are buffered through 
the same package. (See Figure 12-26.) 

Figure 12-26. Signals Between the Emulator and the 'C3x With Transmission Signals 
Buffered 
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□ All signals buffered. The distance between the emulation header and the 
'C3x is greater than 6 inches but less than 12 inches. All 'C3x emulation 
signals, EMUO, EMU1, EMU2, EMU3, and H3, are buffered through the 
same package. (See Figure 1 2-27.) 



Figure 12-27. All Signals Buffered 
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12.7.4 Mechanical Dimensions for the 12-Pin Emulator Connector 

The 'C3x emulator target cable consists of a three-foot section of jacketed 
cable, an active cable pod, and a short section of jacketed cable that connects 
to the target system. The overall cable length is approximately three feet, ten 
inches. Figure 12-28 and Figure 12-29 show the mechanical dimensions for 
the target cable pod and short cable. Note that the pin-to-pin spacing on the 
connector is 0.100 inches in both the X and Y planes. The cable pod box is 
nonconductive plastic with four recessed metal screws. 
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Figure 12-28. Pod/Connector Dimensions 




Refer to Figure 12-29. 
Note: All dimensions are in inches and are nominal unless otherwise specified. 
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Figure 12-29. 12-Pin Connector Dimensions 
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Note: All dimensions are in inches and are nominal unless otherwise specified. 



12.7.5 Diagnostic Applications 



For system diagnostics applications, or to embed emulation compatibility on 
your target system, you can connect a 'C3x device directly to a Tl ACT8990 
test bus controller (TBC) as shown in Figure 12-30. The TBC is described in 
the Texas I nstruments Advanced Logic and Bus Interface Logic Data Book (lit- 
erature number SCYD001). A TBC can connect to only one 'C3x device. 
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Figure 12-30. TBC Emulation Connections for 'C3x Scan Paths 
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Notes: 



1) In a 'C3x design, the TBC can connect to only one 'C3x device. 

2) The 'C3x device's H1 clock drives TCKI on the TBC. This is different from the 
emulation header connections where H3 is used. 
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This chapter covers the TMS320C3x pinouts, signal descriptions, and 



electrical characteristics. 

Major topics discussed in this chapter are as follows: 

Topic Page 

13.1 Pinout and Pin Assignments 13-2 

13.2 Signal Descriptions . 13-16 

13.3 Electricai Specifications 13-25 

13.4 Signal Transition Levels 1 3-29 

13.5 Timing .......13-30 
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13.1 Pinout and Pin Assignments 

13.1.1 TMS320C30 Pinouts and Pin Assignments 

The TMS320C30 digital signal processor is available in a 181 -pin grid array 
(PGA) package. Figure 13-1 and Figure 13-2 show the pinout for this pack- 
age. Figure 13-3 shows the mechanical layout. Table 13-1 shows the 
associated pin assignments alphabetically; Table 13-2 shows the pin assign- 
ments numerically. 
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Figure 13-1. TMS320C30 Pinout (Top View) 
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Figure 13-2. TMS320C30 Pinout (Bottom View) 
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Figure 13-3. TMS320C30 181-Pin PGA Dimensions— GEL Package 




Thermal Resistance Characteristics 
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(181 Places) 



1.52 (0.060) 
1.02 (0.040) 
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® View 
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All linear dimensions are in millimeters and parenthetically in inches. 
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Table 13-1. TMS320C30-PGA Pin Assignments (Alphabetical^ 



Signal 


Pin 


Signal 


Pin 


Signal 


Pin 


Signal 


Pin 


Signal 


Pin 


AO 


F15 


D8 


B5 


EMU8 


F14 


Vbbp 


D3 


XD15 


R10 


A1 


G12 


D9 


C6 


FSRO 


P3 


v D d 


D8 


XD16 


M9 


A2 


G13 


D10 


A5 


FSR1 


M3 


v D d 


H4 


XD17 


P10 


A3 


G14 


D11 


B6 


FSXO 


R2 


v D d 


H12 


XD18 


R11 


A4 


G15 


D12 


D7 


FSX1 


P1 


v D d 


M8 


XD19 


N10 


A5 


H15 


D13 


A6 


H1 


B3 


v S s 


C8 


XD20 


P11 


A6 


H14 


D14 


C7 


H3 


A1 


v S s 


H3 


XD21 


R12 


A7 


J15 


D15 


B7 


HOLD 


F3 


v S s 


H13 


XD22 


M10 


A8 


J14 


D16 


A7 


HOLDA 


E2 


v S s 


N8 


XD23 


N11 


A9 


J13 


D17 


A8 


IACK 


G1 


V SUBS 


E4 


XD24 


P12 


A10 


K15 


D18 


B8 


INTO 


H2 


X1 


C2 


XD25 
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D19 
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t ADVqd. CV$s. ddv DD> DVss. IODVdd. IVss. MDVqd. PDVqd. v DD. and V SS P ins are on a common plane Internal to the 
device. 
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Table 13-2. TMS320C30-PGA Pin Assignments (Numerical)^ 



Signal 
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Signal 


Pin 


Signal 


Pin 


Signal 


Pin 


Signal 


Pin 
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XA6 
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N14 






EMU5 


C1 


RESET 
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N15 
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STRB 
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P1 
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C3 
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DX1 


P2 
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L2 


FSRO 


P3 






D5 
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F12 


RSV9 


L3 
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P4 






D9 


C6 


EMU2 


F13 
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L4 


XD1 


P5 






D14 


C7 


EMU8 


F14 
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L8 


XD4 


P6 






v S s 


C8 
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F15 


A22 


L12 


XD8 


P7 






D21 


C9 


IACK 


G1 


A18 


L13 


XD10 


P8 






D26 
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L14 


XD13 


P9 







t ADVqd. CVss, DDVdd, DVss. IODVdd. IV$$, MDVqd. PDVqd, Vqd. ar, d Vss P'ns are on a common plane internal to the 
device. 
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13.1.2 TMS320C30 PPM Pinouts and Pin Assignments 

The TMS320C30 PPM device is packaged in a 208-pin plastic quad flat pack 
(PQFP) JDEC standard package. Figure 1 3-4 shows the pinouts for this pack- 
age, and Figure 1 3-5 shows the mechanical layout. Table 1 3-3 shows the as- 
sociated pin assignments alphabetically; Table 13-4 shows the assignments 
numerically. 



Figure 13-4. TMS320C30 PPM Pinout (Top View) 
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Figure 13-5. TMS320C30 PPM 208-Pin Plastic Quad Flat Pack—PQL Package 



156 



157^ 



208<= 



T 



30,7 (1.209) 
30,5 (1.201) 



SQ" 



28,1 (1.106) 
27,9 (1.098) ou 



105 



'104 



0,28(0.0110 2) 
J£" 0,18(1.00709) 



T 



0,50 (0.01968) TYP 




0,20 (0.008) 
0,12(0.005) 



0,25 (0.001) MIN 
Seating Plane 



Jl 

4,20 (0.165) MAX 



0,60 (0.024) 
0,40(0.016) 



7 



0°-5° 



404001 6/A-1 0/93 



Notes: 1) All linear dimensions are in millimeters and parenthetically in inches. 

2) This drawing is subject to change without notice. 

3) Contact a field sales office to determine if a tighter coplanarity requirement is available for this package. 
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Table 13-3. TMS320C30-PPM Pin Assignments (Alphabetical)^ 



Signal 


Pin 


Signal 


Pin 


Signal 


Pin 


Signal 


Pin 


Signal 


Pin 


AO 


139 


D6 


197 


EMUO 


140 


RSV3 


37 


XA10 


148 


A1 


138 


D7 


196 


EMU1 


141 


RSV4 


38 


XA11 


147 


A2 


137 


D8 


195 


EMU2 


142 


RSV5 


39 


XA12 


146 


A3 


136 


D9 


194 


EMU3 


143 


RSV6 


40 


XDO 


64 


A4 


135 


D10 


193 


EMU4/SHZ 


144 


RSV7 


41 


XD1 


65 


A5 


134 


D11 


192 


EMU5 


9 


RSV8 


42 


XD2 


66 


A6 


129 


D12 


191 


EMU6 


63 


RSV9 


43 


XD3 


69 


A7 


128 


D13 


190 


FSRO 


56 


RSV10 


44 


XD4 


70 


A8 


127 


D14 


189 


FSR1 


46 


R/W 


20 


XD5 


71 


A9 


126 


D15 


188 


FSXO 


59 


STRB 


19 


XD6 


72 


A10 


125 


D16 


187 


FSX1 


49 


TCLKO 


61 


XD7 


73 


A11 


124 


D17 


186 


H1 


204 


TCLK1 


62 


XD8 


74 


A12 


123 


D18 


180 


H3 


205 


V BBP 


8 


XD9 


75 


A13 


122 


D19 


179 


HOLD 


15 


V D D 


26 


XD10 


76 


A14 


119 


D20 


178 


HOLDA 


14 


v D d 


27 


XD11 


82 


A15 


118 


D21 


177 


IACK 


24 


v D d 


77 


XD12 


83 


A16 


117 


D22 


176 


INTO 


25 


v D d 


78 


XD13 


84 


A17 


116 


D23 


175 


INT1 


31 


v D d 


130 


XD14 


85 


A18 


115 


D24 


174 


INT2 


32 


v D d 


131 


XD15 


86 


A19 


114 


D25 


173 


INT3 


33 


v D d 


181 


XD16 


87 


A20 


113 


D26 


170 


IODV DD 


67 


v D d 


182 


XD17 


88 


A21 


112 


D27 


169 


IODV DD 


68 


v S s 


29 


XD18 


89 


A22 


111 


D28 


168 


IODV DD 


102 


v S s 


30 


XD19 


90 


A23 


110 


D29 


167 


IODV DD 


103 


v S s 


80 


XD20 


91 


ADV DD 


120 


D30 


166 


ivss 


153 


v S s 


81 


XD21 


92 


ADV DD 


121 


D31 


165 


!V SS 


154 


v S s 


132 


XD22 


93 


ADV DD 


157 


DDV DD 


171 


MC/MP 


145 


v S s 


133 


XD23 


94 


ADV DD 


158 


DDV DD 


172 


MDV DD 


16 


v S s 


184 


XD24 


95 


CLKRO 


57 


DDV DD 


206 


MDV DD 


17 


v S s 


185 


XD25 


96 


CLKR1 


47 


DDV DD 


207 


MSTRB 


11 


V SUBS 


7 


XD26 


97 


CLKXO 


58 


DRO 


55 


NC 


28 


X1 


6 


XD27 


98 


CLKX1 


48 


DR1 


45 


NC 


79 


X2/CLKIN 


5 


XD28 


99 


cv ss 


3 


DV SS 


1 


NC 


104 


XAO 


164 


XD29 


100 


cv ss 


4 


DV SS 


2 


NC 


183 


XA1 


163 


XD30 


101 


cv ss 


107 


DV SS 


51 


NC 


208 


XA2 


162 


XD31 


109 


cv ss 


108 


DV SS 


52 


PDV DD 


53 


XA3 


161 


XFO 


23 


DO 


203 


DV SS 


105 


PDV DD 


54 


XA4 


160 


XF1 


22 


D1 


202 


DV SS 


106 


RDY 


18 


XA5 


159 


XRDY 


10 


D2 


201 


DV SS 


155 


RESET 


21 


XA6 


152 


XR/W 


13 


D3 


200 


DV SS 


156 


RSVO 


34 


XA7 


151 


XSTRB 


12 


D4 


199 


DXO 


60 


RSV1 


35 


XA8 


150 






D5 


198 


DX1 


50 


RSV2 


36 


XA9 


149 







t ADVdd.CVss. DDVdd. DV$s, IODN/qd. IVss. MDVdd. PDVdd, Vdd> and V SS P' ns are on a common plane internal to the 
device. 
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Table 13-4. TMS320C30-PPM Pin Assignments (Numerical)^ 



Pin 


Signal 


Pin 


Signal 


Pin 


Signal 


Pin 


Signal 


Pin 


Signal 


1 


DV SS 


43 


RSV9 


85 


XD14 


127 


A8 


169 


D27 


2 


DV SS 


44 


RSV10 


86 


XD15 


128 


A7 


170 


D26 


3 


cv ss 


45 


DR1 


87 


XD16 


129 


A6 


171 


DDV DD 


4 


cv ss 


46 


FSR1 


88 


XD17 


130 


V D D 


172 


DDV DD 


5 


X2 


47 


CLKR1 


89 


XD18 


131 


v D d 


173 


D25 


6 


X1 


48 


CLKX1 


90 


XD19 


132 


v S s 


174 


D24 


7 


V SUBS 


49 


FSX1 


91 


XD20 


133 


v S s 


175 


D23 


8 


V BBP 


50 


DX1 


92 


XD21 


134 


A5 


176 


D22 


9 


EMU5 


51 


DV SS 


93 


XD22 


135 


A4 


177 


D21 


10 


XRDY 


52 


DV SS 


94 


XD23 


136 


A3 


178 


D20 


11 


MSTRB 


53 


PDV DD 


95 


XD24 


137 


A2 


179 


D19 


12 


XSTRB 


54 


PDV DD 


96 


XD25 


138 


A1 


180 


D18 


13 


XR/W 


55 


DRO 


97 


XD26 


139 


AO 


181 


V D D 


14 


HOLDA 


56 


FSRO 


98 


XD27 


140 


EMUO 


182 


v D d 


15 


HOLD 


57 


CLKRO 


99 


XD28 


141 


EMU1 


183 


NC 


16 


MDV DD 


58 


CLKXO 


100 


XD29 


142 


EMU2 


184 


v S s 


17 


MDV DD 


59 


FSXO 


101 


XD30 


143 


EMU3 


185 


v S s 


18 


RDY 


60 


DXO 


102 


lODV DD 


144 


EMU4/SHZ 


186 


D17 


19 


STRB 


61 


TCLKO 


103 


IODV DD 


145 


MC/MP 


187 


D16 


20 


R/W 


62 


TCLK1 


104 


NC 


146 


XA12 


188 


D15 


21 


RESET 


63 


EMU6 


105 


DV SS 


147 


XA11 


189 


D14 


22 


XF1 


64 


XDO 


106 


DV SS 


148 


XA10 


190 


D13 


23 


XFO 


65 


XD1 


107 


cv ss 


149 


XA9 


191 


D12 


24 


IACK 


66 


XD2 


108 


cv ss 


150 


XA8 


192 


D11 


25 


INTO 


67 


IODV DD 


109 


XD31 


151 


XA7 


193 


D10 


26 


V D D 


68 


iodv dd 


110 


A23 


152 


XA6 


194 


D9 


27 


v D d 


69 


XD3 


111 


A22 


153 


iv ss 


195 


D8 


28 


NC 


70 


XD4 


112 


A21 


154 


ivss 


196 


D7 


29 


v S s 


71 


XD5 


113 


A20 


155 


DV SS 


197 


D6 


30 


v S s 


72 


XD6 


114 


A19 


156 


DV SS 


198 


D5 


31 


INT1 


73 


XD7 


115 


A18 


157 


ADVqd 


199 


D4 


32 


INT2 


74 


XD8 


116 


A17 


158 


ADV DD 


200 


D3 


33 


INT3 


75 


XD9 


117 


A16 


159 


XA5 


201 


D2 


34 


RSVO 


76 


XD10 


118 


A15 


160 


XA4 


202 


D1 


35 


RSV1 


77 


V D D 


119 


A14 


161 


XA3 


203 


DO 


36 


RSV2 


78 


v D d 


120 


ADV DD 


162 


XA2 


204 


H1 


37 


RSV3 


79 


NC 


121 


ADV DD 


163 


XA1 


205 


H3 


38 


RSV4 


80 


v S s 


122 


A13 


164 


XAO 


206 


DDV DD 


39 


RSV5 


81 


v S s 


123 


A12 


165 


D31 


207 


DDV DD 


40 


RSV6 


82 


XD11 


124 


A11 


166 


D30 


208 


NC 


41 


RSV7 


83 


XD12 


125 


A10 


167 


D29 






42 


RSV8 


84 


XD13 


126 


A9 


168 


D28 







t ADVqd. CVss. DDN/qd. DV$s. IODVdd. ,v SS» MDVqd, PDVdd. v DD« and v SSP insareonacommon plane internal to the 
device. 
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13.1.3 TMS320C31 Pinouts and Pin Assignments 

The TMS320C31 device is packaged in a 132-pin plastic quad flat pack 
(PQFP) JDEC standard package. Figure 13-6 shows the pinouts for this pack- 
age, and Figure 1 3-7 shows the mechanical layout. Table 1 3-5 shows the as- 
sociated pin assignments alphabetically; Table 13-6 shows the pin assign- 
ments numerically. 

Figure 13-6. TMS320C31 Pinout (Top View) 



J N r O (O C S 

><><<<<<<<<><>><<>><<>5WLUWUJH>h->|C0> 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 132 131 130 129 128 127 126 125 124 123 122 121 120 119 118 117 



A9[; 


18 


• 


116 


1 oxo 


v*SS L 


19 




115 




A8[ 


20 




114 


2 FSXO 


A7[ 


21 




113 


]v S s 


A6[ 


22 




112 


] CLKXO 


A5Q 


23 




111 


] CLKRO 


VddC 


24 




110 


] FSR0 


A4Q 


25 




109 


]V S S 


A3Q 


26 




108 


] DR0 


A2[ 


27 




107 


] INT3 


A1[ 


28 




106 


] INT2 


AOQ 


29 




105 


DV DD 


vssll 


30 




104 


:v DD 


D31Q 


31 




103 


] INT1 


VddC 


32 




102 


DVss 


VddC 


33 




101 


:v ss 


D30[] 


34 




100 


2 INTO 


vssC 


35 




99 


2 IACK 


vssC 


36 




98 


]XF1 


vssC 


37 




97 


]V DD 


D29Q 


38 




96 


]XF0 


D28C 


39 




95 


] RESET 


VddC 


40 




94 


2 R/W 


D27£ 


41 




93 


2 STRB 


vssC 


42 




92 


] RDY 


D26 C 


43 




91 


DV DD 


D25C 


44 




90 


2 HOLD 


D24C 


45 




89 


2 HOLDA 


D23C 


46 




88 


]X1 


D22[ 


47 




87 


2 X2/CLKIN 


D21[ 


48 




86 


:v ss 


VddC 


49 




85 


DV SS 


D20C 


50 




84 


]v ss 



51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 

uuuuuuuuuuuuuuuuuuuuuuuuuuuuuuuuu 
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Figure 13-7. TMS320C31 132-Pin Plastic Quad Flat Pack—PQL Package 



0,254(0.010) Norn 
0,635 (0.025) Norn -#| |#- 



T 



4,45 (0.175) 
4,19 (0.165) 



+i — M 



0,76 (0.030) Norn 
i 



n 

24,18 (0.952) = 
24,08 (0.948) = 



27,56 (1.085) 
27,31 (1.075) 



24,18(0.952) 
24,08 (0.948) 

27,56 (1.085) 



27,31 (1.075) 
Thermal Resistance Characteristics 



Parameter 


°C/W 


Air Flow 
LFPM 


R0JC 


11.0 


N/A 


R0JA 


49.0 


0 


R 0JA 


35.5 


200 


R0JA 


28.0 


400 


R0JA 


23.5 


600 


R0JA 


21.6 


800 


R0JA 


20.0 


1000 



All linear dimensions are in millimeters and parenthetically in inches. 
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Table 13-5.TMS320C31 Pin Assignments (Alphabetical)* 



Signal 


Pin 


Signal 


Pin 


Signal 


Pin 


Signal 


Pin 


Signal 


Pin 


AO 


29 


D4 


76 


EMUO 


124 


Vdd 


40 


v S s 


84 


A1 


28 


D5 


75 


EMU1 


125 


Vdd 


49 


v S s 


85 


A2 


27 


D6 


74 


EMU2 


126 


Vdd 


59 


v S s 


86 


A3 


26 


D7 


73 


EMU3 


123 


Vdd 


65 


v S s 


101 


A4 


25 


D8 


68 


FSRO 


110 


Vdd 


66 


v S s 


102 


A5 


24 


D9 


67 


FSXO 


114 


Vdd 


74 


v S s 


109 


A6 


23 


D10 


64 


H1 


81 


Vdd 


83 


v S s 


113 


A7 


22 


D11 


63 


H3 


82 


Vdd 


91 


v S s 


117 


A8 


21 


D12 


62 


HOLD 


90 


v D d 


97 


Vss 


119 


A9 


20 


D13 


60 


HOLDA 


89 


v D d 


104 


v S s 


128 


A10 


19 


D14 


58 


IACK 


99 


Vdd 


105 


X1 


88 


All 


18 


D15 


56 


INTO 


100 


Vdd 


115 


X2/CLKIN 


87 


A12 


17 


D16 


55 


INT1 


103 


v D d 


121 


XFO 


96 


A13 


16 


D17 


54 


INT2 


106 


Vdd 


131 


XF1 


98 


A14 


15 


D18 


53 


INT3 


107 


Vdd 


132 






A15 


14 


D19 


52 


MCBIVMP 


127 


Vss 


3 






A16 


13 


D20 


50 


RDY 


92 


v S s 


4 






A17 


12 


D21 


48 


RESET 


95 


v S s 


17 






A18 


11 


D22 


47 


R/W 


94 


Vss 


19 






A19 


10 


D23 


46 


SHZ 


118 


v S s 


30 






A20 


9 


D24 


45 


STRB 


93 


v S s 


35 






A21 


8 


D25 


44 


TCLKO 


120 


v S s 


36 






A22 


7 


D26 


43 


TCLK1 


122 


v S s 


37 






A23 


6 


D27 


41 






v S s 


42 






CLKRO 


5 


D28 


39 






v S s 


51 






CLKXO 


4 


D29 


38 


V D D 


6 


Vss 


57 






DO 


3 


D30 


34 


v D d 


15 


v S s 


61 






D1 


2 


D31 


31 


v D d 


24 


v S s 


69 






D2 


1 


DRO 


108 


v D d 


32 


v S s 


70 






D3 


130 


DXO 


116 


Vdd 


33 


v S s 


71 







"''Vdd and V SS P ins are on a common plane internal to the device. 
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Table 13-6.TMS320C31 Pin Assignments (Numerical)* 



Pin 


Signal 


Pin 


Signal 


Pin 


Signal 


Pin 


Signal 


Pin 


Signal 


1 


A21 


31 


D31 


61 


v S s 


91 


v DD 


121 


v D d 


2 


A20 


32 


V D D 


62 


D12 


92 


RDY 


122 


TCLK1 


3 


v S s 


33 


v D d 


63 


D11 


93 


STRB 


123 


EMU3 


4 


v S s 


34 


D30 


64 


D10 


94 


R/W 


124 


EMUO 


5 


A19 


35 


v S s 


65 


V D D 


95 


RESET 


125 


EMU1 


6 


V D D 


36 


v S s 


66 


v D d 


96 


XFO 


126 


EMU2 


7 


A18 


37 


v S s 


67 


D9 


97 


V D D 


127 


MCBL/MP 


8 


A17 


38 


D29 


68 


D8 


98 


XF1 


128 


v S s 


9 


A16 


39 


D28 


69 


v S s 


99 


IACK 


129 


A23 


10 


A15 


40 


V D D 


70 


v S s 


100 


INTO 


130 


A22 


11 


A14 


41 


D27 


71 


v S s 


101 


v S s 


131 


V D D 


12 


A13 


42 


v S s 


72 


D7 


102 


v S s 


132 


v D d 


13 


A12 


43 


D26 


73 


D6 


103 


INT1 






14 


A11 


44 


D25 


74 


V D D 


104 


V D D 






15 


V D D 


45 


D24 


75 


D5 


105 


v D d 






16 


A10 


46 


D23 


76 


D4 


106 


INT2 






17 


v S s 


47 


D22 


77 


D3 


107 


INT3 






18 


A9 


48 


D21 


78 


D2 


108 


DRO 






19 


v S s 


49 


V D D 


79 


D1 


109 


v S s 






20 


A8 


50 


D20 


80 


DO 


110 


FSRO 






21 


A7 


51 


v S s 


81 


H1 


111 


CLKRO 






22 


A6 


52 


D19 


82 


H3 


112 


CLKXO 






23 


A5 


53 


D18 


83 


V D D 


113 


v S s 






24 


V D D 


54 


D17 


84 


v S s 


114 


FSXO 






25 


A4 


55 


D16 


85 


v S s 


115 


V D D 






26 


A3 


56 


D15 


86 


v S s 


116 


DXO 






27 


A2 


57 


v S s 


87 


X2/CLKIN 


117 


v S s 






28 


A1 


58 


D14 


88 


X1 


118 


SHZ 






29 


AO 


59 


V D D 


89 


HOLDA 


119 


v S s 






30 


v S s 


60 


D13 


90 


HOLD 


120 


TCLKO 







t Vqd ano " Vss P' ns are on a common plane internal to the device. 
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13.2 Signal Descriptions 

13.2.1 TMS320C30 Signal Descriptions 

Table 13-7 describes the signals that the TMS320C30 device uses in the 
microprocessor mode. It lists the signal/port/bit name; the number of pins allo- 
cated; the input (I), output (0), or high-impedance state (Z) operating modes; 
a brief description of the signal's function; and the condition that place s an out - 
put pin in high impedance. A line over a signal name (for example, RESET) 
indicates that the signal is active (low) (true at a logic 0 level). Pins labeled NC 
are not to be connected by the user. The signals are grouped according to 
function. 
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Table 13-7. TMS320C30 Signal Descriptions 



Signal/Port 


#Pins 


l/O/Zt 


Description 


Condition When 
Signai Is in High Z* 


Primary Bus Interface (61 Pins) 


D31-D0 


32 


l/O/Z 


32-bit data port of the primary bus interface 


S 


H R 


A23-A0 


24 


o/z 


24-bit address port of the primary bus inter- 
face 


S 


H R 


R/W 


1 


o/z 


Read/write signal for primary bus interface. This 
pin is high when a read is performed and low 
when a write is performed over the parallel inter- 
face. 


S 


H R 


STRB 


1 


o/z 


External access strobe for the primary bus 
interface 


S 


H 


RDY 


1 


I 


Ready signal. This pin indicates that the exter- 
nal device is prepared for a primary bus inter- 
face transaction to complete. 


S 




HOLD 


1 


I 


Hold signal for primary bus interface. When 
HOLD is a logic low, any ongoing transaction is 
completed. The A23-A0, D31-D0, STRB, and 
R/W signals are placed in a high-impedance 
state, and all transactions over the primary bus 
interface are held until HOLD becomes a logic 
high or the NOHOLD bit of the primary bus con- 
trol register is set. 






HOLDA 


1 


o/z 


Hold acknowledge signal for primary bus inter- 
face. This signal is generated in response to a 
logic low on HOLD. It signals that A23-A0, D31- 
DO, STRB, and R/W are placed in a high-impe- 
dance state and that all transactions over the 
bus will be held. HOLDA will be high in response 
to a logic high of HOLD or when the NOHOLD 
bit of the primary bus control register is set. 


S 




Expansion Bus Interface (49 Pins) 


XD31-XD0 


32 


l/O/Z 


32-bit data port of the expansion bus interface 


s 


R 


XA12-XA0 


13 


o/z 


13-bit address port of the expansion bus inter- 
face 


s 


R 


XR/W 


1 


o/z 


Read/write signal for expansion bus interface. 
When a read is performed, this pin is held high; 
when a write is performed, this pin is low. 


s 


R 


MSTRB 


1 


o/z 


External memory access strobe for the expan- 
sion bus interface 


s 





t Input (I), output (O), high-im pedance st ate (Z) 

* S = SHZ active, H = HOLD active, R = RESET active 
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Table 13-7. TMS320C30 Signal Descriptions (Continued) 



Signal/Port 


#Pins 


l/O/Zt 


Description 


Condition When 
Signal Is in High Z* 


Expansion Bus Interface (49 Pins) (Continued) 


IOSTRB 


1 


o/z 


External I/O access strobe for expansion bus 
interface 


S 




XRDY 


1 


I 


Ready signal. This pin indicates that the exter- 
nal device is prepared for an expansion bus in- 
terface transaction to complete. 






Control Signals (9 Pins) 


RESET 


1 


I 


Reset. When this pin is a logic low, the device is 
placed in the reset condition. After reset be- 
comes a logic high, execution begins from the 
location specified by the reset vector. 






INT3-INT0 


4 


I 


External interrupts 






IACK 


1 


o/z 


Interrupt acknowledge signal. IACK is set to 1 
(logic high) by the IACK instruction. This can be 
used to indicate the beginning or end of an in- 
terrupt service routine. 


S 




MC/MP 


1 


I 


Microcomputer/microprocessor mode pin 






XF1.XF0 


2 


l/O/Z 


External flag pins. They are used as general- 
purpose I/O pins or to support interlocked pro- 
cessor instructions. 


S 


R 


Serial Port 0 Signals (6 Pins) 


CLKXO 


1 


l/O/Z 


Serial port 0 transmit clock. Serves as the serial 
shift clock for the serial port 0 transmitter. 


S 


R 


DXO 


1 


l/O/Z 


Data transmit output. Serial port 0 transmits se- 
rial data on this pin. 


S 


R 


FSXO 


1 


l/O/Z 


Frame synchronization pulse for transmit. The 
FSXO pulse initiates the transmit data process 
over pin DXO. 


S 


R 


CLKRO 


1 


l/O/Z 


Serial port 0 receive clock. Serves as the serial 
shift clock for the serial port 0 receiver. 


S 


R 


DRO 


1 


l/O/Z 


Data receive. Serial port 0 receives serial data 
via the DRO pin. 


S 


R 


FSRO 


1 


l/O/Z 


Frame synchronization pulse for receive. The 
FSRO pulse initiates the receive data process 
over DRO. 


S 


R 



t Input (I), output (O), high-impedance state (Z) 

* S = SHZ active, H = HOLD active, R = RESET active 
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Table 13-7. TMS320C30 Signal Descriptions (Continued) 



Signal/Port 


#Pins 


l/O/Zt 


Description 


Condition When 
Signal Is in High Z* 








Serial Port 1 Signals (6 Pins) 






CLKX1 


1 


l/O/Z 


Serial port 1 transmit clock. Serves as the seri- 
al shift clock for the serial port 1 transmitter. 


S 


R 


DX1 


1 


l/O/Z 


Data transmit output. Serial port 1 transmits 
serial data on this pin. 


S 


R 


FSX1 


1 


l/O/Z 


Frame synchronization pulse for transmit. The 
FSX1 pulse initiates the transmit data process 
over pin DX1 . 


S 


R 


CLKR1 


1 


l/O/Z 


Serial port 1 receive clock. Serves as serial 
shift clock for the serial port 1 receiver. 


S 


R 


DR1 


1 


l/O/Z 


Data receive. Serial port 1 receives serial data 
via the DR1 pin. 


S 


R 


FSR1 


1 


l/O/Z 


Frame synchronization pulse for receive. The 
FSR1 pulse initiates the receive data process 
over DR1 . 


S 


R 


Timer 0 Signals (1 Pin) 


TCLKO 


1 


l/O/Z 


Timer clock. As input, TCLKO is used by timer 0 
to count external pulses. As output pin, TCLKO 
outputs pulses generated by timer 0. 


S 


R 


Timer 1 Signals (1 Pin) 


TCLK1 


1 


l/O/Z 


Timer clock. As input, TCLK1 is used by timer 1 
to count external pulses. As output pin, TCLK1 
outputs pulses generated by timer 1 . 


S 


R 


Supply and Oscillator Signals (29 Pins) 


V DD3~ V DD0 


4 


I 


Four +5-V supply pins § 






IODV DD1 ,IODV DD0 


2 


I 


Two +5-V supply pins § 






ADV DD1 ,ADV DD0 


2 


I 


Two +5-V supply pins § 






PDV DD 


1 


I 


One +5-V supply pin § 







t Input (I), output (O), hi gh-im pedance st ate (Z) 

* S = SHZ active, H = HOLD active, R = RESET active 

§ The recommended decoupling capacitor is 0.1 piF. 
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Table 13-7. TMS320C30 Signal Descriptions (Continued) 



Signal/Port 


#Pins 


l/O/Zt 


Description 


Condition When 
Signal Is in High Z* 


Supply and Oscillator Signals (29 Pins) (Continued) 


DDV DD1> DDV DD0 


2 


1 


Two +5-V supply pins § 




MDV DD 


1 


1 


One +5-V supply pin § 




V SS3- V SS0 


4 




Four ground pins 




DV SS3 _DV SS0 


4 


1 


Four ground pins 




CVssi.CVsso 


2 




Two ground pins 




"Vss 


1 


' 


One ground pin 




V BBP 


1 


NC 


Vbb pump oscillator output 




V SUBS 


i 


i 


Substrate pin. Tie to ground. 




X1 


1 


0 


Output pin from internal oscillator for the crystal. 
If crystal not used, pin should be left uncon- 
nected. 




X2/CLKIN 


1 


1 


Input pin to internal oscillator from a crystal or a 
clock 




H1 


1 


o/z 


External H1 clock — has a period equal to twice 
CLKIN. 


S 


H3 


1 


o/z 


External H3 clock— has a period equal to twice 
CLKIN. 


S 



t Input (I), output (O), high-im pedance st ate (Z) 
* S = SHZ active, H = HOLD active, R = RESET active 

§ Follow the connections specified for the reserved pins. 1 8- to 22-kQ pull-up resistors are recommended. All +5-volt supply pins 
must be connected to a common supply plane, and all ground pins must be connected to a common ground plane. 
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Table 13-7.TMS320C30 Signal Descriptions (Continued) 



Signal/Port 


#Pins 


l/O/Zt 


Description 


Condition When 
Signal Is in High Z* 


Reserved (18 Pins) § 


EMU2-EMU0 


3 


I 


Reserved. Use pull-ups to +5 volts. See Sec- 

t!r\n 19 7 r\n nana 1 0_QQ 
llUfl \c..l UN pdyc l^'Oa. 




EMU3 




o 


Ppcprv/pH Spp Spptinn 19 7 nn nanp 1 9-^Q 




EMU4/SHZ 




I 


Shutdown hinh imnpdanre An active low shuts 

V-/I lUlUUVVI 1 I IIUI 1 II 1 IUCUCU Iv/w. #VI 1 HwUVw Iwif wild Iw 

down the TMS320C30 and places all pins in a 
high-impedance state. This signal is used for 
board-level testing to ensure that no dual drive 
conditions occur. CAUTION: An active low on 
the SHZ pin corrupts TMS320C30 memory and 
register contents. Reset the device with an 
SHZ=1 to restore it to a known operating condi- 
tion. 




EMU6, EMU5 


2 


NC 


Reserved. 




RSV1 0-RSV5 


6 


I/O 


Reserved. Use pull-ups on each pin to +5 volts. 




RSV4~-RSV0 


5 


I 


Reserved. Tie pins directly to +5 volts. 




Locator (1 Pin) 


Locator 


1 


NC 


Reserved. See Figure 13-1 on page 13-3 and 
Table 13-1 on page 13-6. 





t Input (I), output (0), high-im pedance st ate (Z) 

* S = SHZ active, H = HOLD active, R = RESET active 

§ Follow the connections specified for the reserved pins. 18- to 22-kQ pull-up resistors are recommended. All +5-volt supply pins 
must be connected to a common supply plane, and all ground pins must be connected to a common ground plane. 
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13.2.2 TMS320C31 Signal Descriptions 

Table 13-8 describes the signals that the TMS320C31 device uses in the 
microprocessor mode. They are listed according to the signal name; the num- 
ber of pins allocated; the input (I), output (O), or high-impedance state (Z) op- 
erating modes; a brief description of the signal's function; and the condition 
that places an output pin in high impedance. A line over a signal name (for ex- 
ample, RESET) indicates that the signal is active (low) (true at a logic 0 level). 



Table 13-8.TMS320C31 Signal Descriptions 



Signal/Port 


#Pins 


l/O/Zt 


Description 


Condition When 
Signal Is in High Z* 


Primary Bus Interface (61 Pins) 


D31-D0 


32 


l/O/Z 


32-bit data port 


S 


H R 


A23— AO 


24 






o 


if d 

n H 


HOLD 


1 


I 


Hold signal. When HOLD is a logic low, any on- 
going transaction is completed. The A23-A0, 
uoi-uu, o i riD, ana ri/vv signals are piacea in 
a high-impedance state, and all transactions 
over the primary bus interface are held until 
HOLD becomes a logic high or until the NO- 
HOLD bit of the primary bus control register is 
set. 






HOLDA 


1 


o/z 


Hold acknowledge signal. This signal is gener- 
ated in response to a logic low on HOLD. It sig- 
nals that A23-A0, D31-D0, STRB, and R/W are 
placed in a high-impedance state and that all 
transactions over the bus will be held. HOLDA 
will be high in response to a logic high of HOLD 
or until the NOHOLD bit of the primary bus con- 
trol register is set. 


s 




R/W 


1 


o/z 


Read/write signal. This pin is high when a read 
is performed and low when a write is performed 
over the parallel interface. 


s 


H R 


RDY 


1 


I 


Ready signal. This pin indicates that the exter- 
nal device is prepared for a transaction comple- 
tion. 






STRB 


1 


o/z 


External access strobe 


s 


H 



t Input (I), output (O), high-im pedance ( Z) state 

* S = SHZ active, H = HOLD active, R = RESET active 
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Table 13-8.TMS320C31 Signal Descriptions (Continued) 



Signal/Port 


#Pins 


l/O/Zt 


Description 


Condition When 
Signal Is In High Z* 


Control Signals (10 Pins) 


INT3-INT0 


4 


1 


External interrupts 






IACK 


1 


o/z 


Interrupt acknowledge signal. IACK is set to 1 
by the IACK instruction. This can be used to in- 
dicate the beginning or end of an interrupt ser- 
vice routine. 


S 




MCBLVMP 


1 


1 


Microcomputer boot loader/microprocessor 
mode pin 






RESET 


1 


1 


Reset. When this pin is a logic low, the device is 
placed in the reset condition. When reset be- 
comes a logic 1 , execution begins from the loca- 
tion specified by the reset vector. 






OUT 


1 


1 

1 


Shut down high Z. An active (low) shuts down 
the TMS320C31 and places all pins in a high- 
impedance state. This signal is used for board- 
level testing to ensure that no dual drive condi- 
tions occur. CAUTION: An active (low) on the 
SHZ pin corrupts TMS320C31 memory and reg- 
ister contents. Reset the device with an SHZ = 1 
to restore it to a known operating condition. 






XF1.XF0 


2 


l/O/Z 


External flag pins. These are used as general- 
purpose I/O pins or to support interlocked pro- 
cessor instructions. 


S 


R 


Serial Port 0 Signals (6 Pins) 


CLKRO 




l/O/Z 


Serial port 0 receive clock. This pin serves as 
the serial shift clock for the serial port 0 receiver. 


S 


R 


CLKXO 




l/O/Z 


Serial port 0 transmit clock. Serves as the serial 
shift clock for the serial port 0 transmitter. 


S 


R 


DRO 




l/O/Z 


Data receive. Serial port 0 receives serial data 
via the DRO pin. 


s 


R 


DXO 




l/O/Z 


Data transmit output. Serial port 0 transmits se- 
rial data on this pin. 


s 


R 


FSRO 




l/O/Z 


Frame synchronization pulse for receive. The 
FSRO pulse initiates the receive data process 
over DRO. 


s 


R 



T Input (I), output (O), high-im pedance st ate (Z) 

* S = SHZ active, H = HOLD active, R = RESET active 
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Table 13-8.TMS320C31 Signal Descriptions (Continued) 



Signal/Port 


#Pins 


I/O/Zt 


Description 


Condition When 
Signal Is in High Z* 


Serial Port 0 Signals (6 Pins) (Continued) 


FSXO 


1 


l/O/Z 


Frame synchronization pulse for transmit. The 
FSXO pulse initiates the transmit data process 
over pin DXO. 


S R 


Timer Signals (2 Pins) 


TCLKO 


1 


l/O/Z 


Timer clock 0. As an input, TCLKO is used by 
timer 0 to count external pulses. As an output 
pin, TCLKO outputs pulses generated by timer 
0. 


S 


TCLK1 


1 


l/O/Z 


Timer clock 1 . As an input, TCLKO is used by 
timer 1 to count external pulses. As an output 

pin, iv/u\i uuipuio puioco ycncicucu uy uinci 
1. 


S 


Supply and Oscillator Signals (49 Pins) 


H1 


1 


o/z 


External H1 clock. This clock has a period 
equal to twice CLKIN. 


S 


H3 


1 


o/z 


External H3 clock. This clock has a period 
equal to twice CLKIN. 


S 


V D D 


20 


I 


+5-Vqc supply pins. All pins must be con- 
nected to a common supply plane. § 




v S s 


25 


I 


Ground pins. All ground pins must be con- 
nected to a common ground plane. 




X1 


1 


o/z 


Output pin from the internal crystal oscillator. If 
a crystal is not used, this pin should be left un- 
connected. 




X2/CLKIN 


1 


I 


The internal oscillator input pin from a crystal or 
a clock. 




Reserved (4 Pins) * 


EMU2-EMU0 


3 


I 


Reserved. Use 20-kQ pull-up resistors to +5 
volts. 




EMU3 


1 


0 


Reserved. 





t Input (I), output (O), high-im pedance st ate (Z) 

* S = SHZ active, H = HOLD active, R = RESET active 

§ The recommended decoupling capacitor value is 0.1 \iF. 

U Follow the connections specified for the reserved pins. 1 8- to 22-kQ pull-up resistors are recommended. All +5-volt supply pins 
must be connected to a common supply plane, and all ground pins must be connected to a common ground plane. 
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13.3 Electrical Specifications 

Table 13-9, Table 13-10, Table 13-11, and Figure 13-8 show the electrical 
specifications for the TMS320C3x. 



Table 13-9. Absolute Maximum Ratings Over Specified Temperature Range 



Condition/Characteristic 


'C30/'C31 Range 


'LC31 Range 


Supply voltage range, Vqd 


-0.3 V to 7 V 


-0.3 V to 5 V 


Input voltage range 


-0.3 V to 7 V 


-0.3 V to 5 V 


Output voltage range 


-0.3 V to 7 V 


-0.3 V to 5 V 


Continuous power dissipation (worst case) 


3.1 5 W for TMS320C30-33 
1.7WforTMS320C31-33 
(See Note 3) 


1.1 W 

(See Note 3) 


Operating case temperature range 


TMS320C30GEL 0 ° C to 85 °C 
TMS320C31PQL 0°Cto85°C 
TMS320C31PQA -40°C to +125°C 


0°Cto 85°C 


Storage temperature range 


-55 °C to150°C 


-55 °C to150°C 



Notes: 1 ) All voltage values are with respect to V33. 



2) Stresses beyond those listed above may cause permanent damage to the device. This is a stress rating only; 
functional operation of the device at these or any other conditions beyond those indicated in Table 1 3-1 0 is not im- 
plied. Exposure to absolute-maximum-rated conditions for extended periods may affect device reliability. 

3) Actual operating power will be less than stated. These values were obtained under specially produced worst-case 
test conditions, which are not sustained during normal device operation. These conditions consist of continuous 
parallel writes of a checkerboard pattern to both primary and expansion buses at the maximum rate possible. See 
nominal (Idd) current specification in Table 13-11 . 
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Table 13-10. Recommended Operating Conditions 







'C30/'C31 


'LC31-33 






rarameier 


MSn 


Norn 


Max 


Min 


Nom 


Max 


unii 


Vnn 


SuddIv voltaaes (DDVnn etc ) 


4.75 


5 


5.25 


3.13 


3.3 


3.47 


V 




SuddIv voltaaes (CVoo etc ^ 




0 






0 




V 


V IH 


Hinh-lpvpl innut voltanp 


2 




V D D + 
+ 0.3t 


1.8 




v DD + 

+ 0.3t 


v 


Vii 


Low-level input voltage 


-0.3 




0.8 


-0.3t 




0.6 


V 


•oh 


High-level output current 






-300 






-300 


ma 


lOL 


Low-level output current 






2 






2 


mA 


T 


Operating case temperature 
range 


0 




85 


0 




85 


OQ 


v TH 


CLKIN high-level input voltage 
for CLKIN 


2.6 


v D d + 

+ 0.3t 




2.5 


Vdd + 

+ 0.3t 




V 



t Guaranteed from characterization but not tested 



Note: All voltage values are with respect to Vss- All input and output voltages except those for CLKIN are TTL compatible. 
CLKIN can be driven by a CMOS clock. 
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Table 13-11. Electrical Characteristics Over Specified Free-Air Temperature Range* 



Electrical Characteristic 



'C30/'C31 



Min Norn* Max 



'LC31-33 



Min Norn* Max 



Unit 



Vqh High-level output voltage (Vdd = Min. 'oh = 
Max) 

Vql^ Low-level output voltage (Vdd = Min. 'ol = 
Max) 



2.4 



2.0 



"l 

"IP 

■cc 



Three-state current (Vdd = Max) 

Input current (V| = Vss to Vdd) 

Input current (Inputs with internal pull-ups) ' 



-20 
-10 
-400 



0.3 0.6 

20 
10 
20 



-20 
-10 
-400 



Supply current (T^ = 
25° C, V P d = Max,f x 
= Max) #ll 



'C30-33 
'C30-27 
'C30-40 
'C31-27 
'C31 -33 

'C31-33 (ext. temp) 
'031-40 
'031-50 
'C30 PPM 



Idd Supply current, standby; IDLE2, clocks shut 
off 

Cj Input capacitance All inputs except 

CLKIN 



200 
175 
170 
120 
150 
150 
160 
200 
170 

50 



CLKIN 



Output capacitance 



600 
500 
600 
260 
325 
325 
390 
425 
600 



15* 



25 
20* 



0.4 

20 
10 
10 

120 300 



21 



15* 



25 



MA 
\xA 
HA 

mA 



mA 
PF 



20* pF 



t All input and output voltage levels are TTL compatible. 

* All nominal values are at Vdd = 5 v » T A = 25°C. 

§ For 'C30 PPM: VoL.(max)=u-6 V, except for the following: 
Voi_(max)=1 VforA(0-31) 
VoL(max)=0.9 V for XA(0- 1 2), P(0-31) 
VQL(max)=0.7 V for STRB, XSTRB, MSTRB, FSX0/I, CLKX0/1 , 
CLKR0/1 , DX0/1 R/W, XR/W 

^ Pins with internal pull-up devices: INT3-INT0, MC/MP, RSV10-RSV0. Although RSV1 0-RSV0 have internal pullup devices, 
external pullups should be used on each pin as described in Table 13-7 beginning on page 13-17. 

* Actual operating current will be less than this maximum value. This value was obtained under specially produced worst-case 
test conditions, which are not sustained during normal device operation. These conditions consist of continuous parallel writes 
of a checkerboard pattern to both primary and expansion buses at the maximum rate possible. See Calculation ofTMS320C30 
Power Dissipation, Appendix D. 

II f x is the input clock frequency. The maximum value is 40 MHz. 

* Guaranteed by design but not tested 
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Where: Iol = 20 mA ( a " outputs) 
Iqh = 300 ( al1 outputs) 
VLoad = 2.15 V 



13-28 



Signal Transition Levels 



13.4 Signal Transition Levels 



13.4.1 TTL-Level Outputs 

TTL-compatible output levels are driven to a minimum logic-high level of 2.4 
volts and to a maximum logic-low level of 0.6 volt. Figure 1 3-9 shows the TTL- 
level outputs. 

Figure 13-9. TTL-Level Outputs 




2.4 V 
2.0 V 



1.0V 
0.6 V 



TTL-output transition times are specified as follows: 

□ For a high-to-low transition, the level at which the output is said to be no 
longer high is 2.0 volts, and the level at which the output is said to be low 
is 1 .0 volt. 

□ For a low-to-high transition, the level at which the output is said to be no 
longer low is 1 .0 volt, and the level at which the output is said to be high 
is 2.0 volts. 



13.4.2 TTL-Level Inputs 

Figure 13-10 shows the TTL-level inputs. 

Figure 13-10. TTL-Level Inputs 




2.0 V 
90% 



10% 
0.8 V 



TTL-compatible input transition times are specified as follows: 

□ For a high-to-low transition on an input signal, the level at which the input 
is said to be no longer high is 2.0 volts, and the level at which the input is 
said to be low is 0.8 volt. 

□ For a low-to-high transition on an input signal, the level at which the input 
is said to be no longer low is 0.8 volt, and the level at which the input is said 
to be high is 2.0 volts. 
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13.5 Timing 

Timing specifications apply to the TMS320C30 and TMS320C31 . 

13.5.1 X2/CLKIN, H1, and H3 Timing 

Table 1 3-1 2 defines the timing parameters for the X2/CLKIN, H1 , and H3 in- 
terface signals. The numbers shown in parentheses in Figure 13-11 and 
Figure 1 3-1 2 correspond with those in the No. column of Table 1 3-1 2. Refer 
to the RESET timing in Figure 1 3-23 on page 1 3-48 for CLKIN to H1/H3 delay 
specification. 



Table 13-12. Timing Parameters forX2/CLKIN, H1, and H3§ 









'030-27/ 
'C31-27 


'C30-33/ 
'C31-33/ 
'LC31 


'C30-40/ 
'C31-40 


'C31-50 




No. 


Name 


Description 


Min 


Max 


Min 


Max 


Min 


Max 


Min 


Max 


Unit 


(1) 




CLKIN fall time 




6* 




5* 




5t 




5* 


ns 


(2) 


MCIL) 


CLKIN low pulse 
duration 
tc(CI) = min 


14 




10 




9 




7 




ns 


(3) 


MCIH) 


CLKIN high pulse 
duration 

tc(Cl) = min 


14 




10 




9 




7 




ns 


(4) 


tr(CI) 


CLKIN rise time 




6* 




5* 




5* 




5* 


ns 


(5) 


tc(CI) 


CLKIN cycle time 


37 


303 


30 


303 


25 


303 


20 


303 


ns 


(6) 


tf(H) 


H1/H3fall time 




4 




3 




3 




3 


ns 


(7) 


tw(HL) 


H1/H3 low pulse 
duration 


P-6 




P-6 




P-5 




P-5 




ns 


(8) 


MHH) 


H1/H3 high pulse 
duration 


P-7 




P-7 




P-6 




P-6 




ns 


(9) 


tr(H) 


H1/H3 rise time 




5 




4 




3 




3 


ns 


(9.1) 


td(HL-HH) 


Delay from H1(H3) 
low to H3(H1) high 


ot 


6 


ot 


5 


ot 


4 


ot 


4 


ns 


(10) 


tc(H) 


H1/H3 cycle time 


74 


606 


60 


606 


50 


606 


40 


606 


ns 



t Guaranteed from characterization but not tested 
$ Guaranteed by design but not tested 
§ P = tc(CI) 
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Figure 1 3-11. Timing for X2/CLKIN 
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F/gure 13-12. Timing for H1/H3 



H1 



! i 



(9) 
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13.5.2 Memory Read/Write Timing 

Table 1 3-1 3 defines memory read/write timing parameters for (M)STRB. The 
numbers shown in parentheses in Figure 13-13 and Figure 13-14 corre- 
spond with those in the No. column of Table 1 3-1 3. 



13-32 



Timing 



Table 13-13. Timing Parameters for a Memory ( (M)STRB) = 0) Read/Write 









'C30-27 
•C31-27 


'C30-33 
'C31-33 
'LC31 


'C30-40 
'C31-40 


'C31-50 




No. 


Name 


Description 


Min 


Max 


Min 


Max 


Min 


Max 


Min 


Max 


Unit 


('\'\\ 


*d(H1L-(M)SL) 


rl I IUW IU ylvlyO 1 nD 

low delay 


0* 


13 


0* 


10 


0* 


6§ 


0* 


4 


nQ 

1 IO 


V*) 


td(H1L-(M)SH) 


n i low io ^ivijo i riD 
high delay 


0* 


13 


0* 


10 


0* 


6 


0* 


4 


lib 


\ 1 0. 1 ) 


^d(HIH-RWL) 


n i nign to n/vv low 
delay 


0* 


13 


0* 


10 


0* 


9 


0* 


7 


1 Id 


M** 9\ 
\ i o.c.) 


*d(H1H-XRWL) 


H1 hinh tn YR/W 
n i iiiyii iu An/ v v 

low delay 


0* 


19 


0* 


15 


0* 


13 






1 IO 


M4 i\ 


td(H1L-A) 


H1 low to A wflliH 

delay 


0* 


16 


0* 


14 


0* 


11 


0* 


9 


ns 


M4 2} 


l d(H1L-XA) 


H1 low to XA valid 
delay 


0* 


12 


0* 


10 


0* 


9 






ns 


MR 1\ 


t su(D)R 


L/ oclUp UolUlC n I 

low (read) 


18 




16 




14 




10 




no. 

1 IO 


M5 2^ 


*su(XD)R 


Vn QPtim hpfnrp H1 

low (read) 


21 




18 




16 








ns 


V id; 


*h((X)D)R 


(Y\D hnlH time after 

y/\yLy 1 IVJIU ill lie duel 

H1 low (read) 


0 




0 




0 




0 




ns 


M7 1\ 

(17.1) 


*su(RDY) 


rut setup ueiore 
H1 high 


10 




8 




8 




6 




No 


M7 2^ 


l su(XRDY) 


XRDY ^Ptun hpfnrp 
An u i oeiup uc i \J\ e 

H1 high 


11 




9 




9 








ns 




*h((X)RDY) 


(Y\RHV hnlH timp 
y/\ jnu 1 i luiu in lie 

after H1 high 


0 




0 




0 




0 




ns 


MQ\ 


td(H1H--(X)RWH) 


H1 hinh tn (Y\R/\FJ 
n i iiiy ii iu yAyri/vv 

high (write) delay 




13 




10 




9 




7 


1 IO 


(20) 


tv((X)D)W 


(X)D valid after H1 
low (write) 




25 




20 




17 




14 


ns 


(21) 


*h((X)D)W 


(X)D hold time after 
H1 high (write) 


0* 




0* 




0* 




0* 




ns 



* Guaranteed by design but not tested 

§ For 'C30 PPM, td(H1L-(M)SL) (max)=7ns 
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Table 13-13. Timing Parameters for a Memory ( (M)STRB) = 0) Read/Write (Continued) 









'C30-27 
'C31-27 


•C30-33 
'C31-33 
'LC31 


'C30-40 
'C31-40 


'C31-50 




No. 


Name 


Description 


Min Max 


Min Max 


Min Max 


Min Max 


Unit 


(22.1) 


td(H1H-A) 


H1 high to A valid 

On DaCK-lO-DaCK 

write cycles (write) 
delay 


23 


18 


15 


12 


ns 


(22.2) 


*d(H1H-XA) 


H1 high to XA valid 
on back-to-back 
write cycles (write) 
delay 


32 


25 


21 




ns 


(26) 


*d(A-(X)RDY) 


(X)RDY delay from 
A valid delay 


10t 


8t 


7t 


6 


ns 



t Guaranteed from characterization but not tested 

* Guaranteed by design but not tested 

§ For 'C30 PPM, td(H1L-(M)SL) (max)=7ns 



Figure 13-13. Timing for Memory ( (M)STRB = 0) Read 



H1 



in). 
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(X)R/W 



|<-» h-(12) 



X 
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-M K- (18) 



Note: (M)STRB will remain low during back-to-back read operations. 
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Figure 13-14. Timing for Memory ( (M)STRB = 0) Write 
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i i 

-W (13.1/13.2) 
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Table 13-14 defines memory read timing parameters for IOSTRB. The num- 
bers shown in parentheses in Figure 13-15 and Figure 13-16 correspond 
with those in the No. column of Table 13-14 and Table 13-15. 



Table 13-14. Timing Parameters for a Memory ( IOSTRB = 0) Read 









'030-27 


'C30-33 


'C30-40 




No. 


Name 


Description 


Min 


Max 


Min 


Max 


Min 


Max 


Unit 


(11.1) 


^(HIH-IOSL) 


H1 high to IOSTRB low delay 


ot 


13 


ot 


10 


ot 


9 


ns 


(12.1) 


td(H1H-IOSH) 


H1 high to IOSTRB high delay 


ot 


13 


ot 


10 


ot 


9 


ns 


(13.1) 


^(HIL-XRWH) 


H1 low to XR/W high delay 


ot 


13 


ot 


10 


ot 


9 


ns 


(14.3) 


tyhhL-XA) 


H1 low to XA valid delay 


ot 


13 


ot 


10 


0* 


9 


ns 


(15.3) 


t su(XD)R 


XD setup before H1 high 


19 




15 




13 




ns 


(16.1) 


th(XD)R 


XD hold time after H1 high 


0 




0 




0 




ns 


(17.3) 


tsu(XRDY) 


XRDY setup before H1 high 


11 




9 




9 




ns 


(18.1) 


th(XRDY) 


XRDY hold time after H1 high 


0 




0 




0 




ns 



t Guaranteed by design but not tested 
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Figure 13-15. Timing for Memory ( IOSTRB = 0) Read 
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Figure 13-16. Timing for Memory ( IOSTRB = 0) Write 
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Table 1 3-1 5 defines memory write timing parameters for IOSTRB. The num- 
bers shown in parentheses in Figure 13-15 and Figure 13-16 correspond 
with those in the No. column of Table 13-14 and Table 13-15. 



Table 13-15. Timing Parameters for a Memory ( IOSTRB = 0) Write 



No. 


Name 


Description 


'C30-27 


'C30-33 


'030-40 


Unit 


Min 


Max 


Min 


Max 


Min 


Max 


(23) 


^(HIL-XRWL) 


H1 low to XR/W low delay 


ot 


19 


ot 


15 


ot 


13 


ns 


(24) 


Mxdjw 


XD valid after H1 high 




38 




30 




25 


ns 


(25) 


th(XD)W 


XD hold time after H1 low 


0 




0 




0 




ns 



t Guaranteed by design but not tested 



TMS320C3x Signal Descriptions and Electrical Characteristics 



13-37 



Timing 



13.5.3 XFO and XF1 Timing When Executing LDFI or LDII 

Table 1 3-1 6 defines the timing parameters for XFO and XF1 during execution 
of LDFI or LDII. The numbers shown in parentheses in Figure 13-17 corre- 
spond with those in the No. column of Table 13-16. 
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Table 13-16. Timing Parameters forXFO andXFI When Executing LDFI or LDII 









'C30-27 
'C31-27 


'C30-33 
'C31-33 
'LC31 


'C30-40 
'C31-40 


'C31-50 




No. 


Name 


Description 


Min Max 


Min Max 


Min Max 


Min 


Max 


Unit 


0) 


^(HSH-XFOL) 


H3 high to XFO low delay 


19 


15 


13 




12 


ns 


(2) 


tsu(XF1) 


XF1 setup before H1 low 


13 


10 


9 


9 




ns 


(3) 


th(XF1) 


XF1 hold time after H1 low 


0 


0 


0 


0 




ns 



Figure 13-17. Timing for XFO and XF1 When Executing LDFI or LDII 

Fetch 

| LDFI or LDII | Decode | Read | Execute | 




XFO Pin 



XF1 Pin 
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13.5.4 XFO Timing When Executing STFI and STII 

Table 13-17 defines the timing parameters for the XFO and XF1 pins during 
execution of STFI or STII. The number shown in parentheses in Figure 1 3-1 8 
corresponds with the number in the No. column of Table 13-17. 



Table 13-17. Timing Parameters for XFO When Executing STFI or STII 



No. Name Description 


'C30-27 
'C31-27 


'C30-33 
'C31-33 
'LC31 


'C30-40 
'C31-40 


'C31-50 


Unit 


Min Max 


Min Max 


Min Max 


Min Max 


0 ) td(H3H-XF0H) H 3 high to XFO high delay 


19 


15 


13 


12 


ns 



XFO is always set high at the beginning of the execute phase of the interlock 
store instruction. When no pipeline conflicts occur, the address of the store is 
also driven at the beginning of the execute phase of the interlock store instruc- 
tion. However, if a pipeline conflict prevents the store from executing, the ad- 
dress of the store will not be driven until the store can execute. 



Figure 13-18. Timing for XFO When Executing an STFI or STII 

Fetch 
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13.5.5 XFO and XF1 Timing When Executing SIGI 

Table 13-18 defines the timing parameters for the XFO and XF1 pins during 
execution of SIGI. The numbers shown in parentheses in Figure 1 3-1 9 corre- 
spond with those in the No. column of Table 1 3-1 8. 



Table 13-18. Timing Parameters for XFO andXFI When Executing SIGI 









'C30-27 
'C31-27 


'C30-33 
'C31-33 
'LC31 


'C30-40 
'C31-40 


'C31-50 




No. 


Name 


Description 


Min Max 


Min Max 


Min Max 


Min 


Max 


Unit 


0) 


^(HSH-XFOL) 


H3 high to XFO low delay 


19 


15 


13 




12 


ns 


(2) 


^(HSH-XFOH) 


H3 high to XFO high delay 


19 


15 


13 




12 


ns 


(3) 


tsu(XF1) 


XF1 setup before H1 low 


13 


10 


9 


9 




ns 


(4) 


VXF1) 


XF1 hold time after H1 low 


0 


0 


0 


0 




ns 



Figure 13-19. Timing for XFO andXFI When Executing SIGI 

Fetch 




— rfW- (4 

3U£ 
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13.5.6 Loading When the XF Pin Is Configured as an Output 

Table 1 3-1 9 defines the timing parameter for loading the XF register when the 
XF pin is configured as an output. The number shown in parentheses in 
Figure 1 3-20 corresponds with the number in the No. column of Table 1 3-1 9. 



Table 13-19. Timing Parameters for Loading the XF Register When Configured as an Output 
Pin 



No. Name Description 


'030-27 
'031-27 


'030-33 
'031-33 
'LC31 


'030-40 
'031-40 


'031-50 


Unit 


Min Max 


Min Max 


Min Max 


Min Max 


0 ) MHSH-XF) H3 h '9 h t0 XF va,id 


19 


15 


13 


12 


ns 



Figure 13-20. Timing for Loading XF Register When Configured as an Output Pin 

Fetch Load 

| Instruction | Decode | Read | Execute | 
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13.5.7 Changing the XF Pin From an Output to an Input 

Table 13-20 defines the timing parameters for changing the XF pin from an 
output pin to an input pin. The numbers shown in parentheses in Figure 1 3-21 
correspond with those in the No. column of Table 13-20. 



Table 13-20. Timing Parameters ofXF Changing From Output to Input Mode 









'C30-27 
'C31-27 


'C30-33 
'C31-33 
'LC31 


'C30-40 
'C31-40 


'C31-50 




No. 


Name 


Description 


Min Max 


Min Max 


Min Max 


Min 


Max 


Unit 


(1) 


*h(H3H-XF01) 


XF hold after H3 high 


19 


15 


13t 




12 


ns 


(2) 


tsu(XF) 


XF setup before H1 low 


13 


10 


9 


9 




ns 


(3) 


th(XF) 


XF hold after H1 low 


0 


0 


0 


0 




ns 



t For 'C30 PPM, t n (H3H-XF01) (max)=14ns 

Figure 13-21. Timing for Change ofXF From Output to Input Mode 



Execute 
Load of IOF 



Buffers Go 
From Output 
to Output 
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Delay 
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13.5.8 Changing the XF Pin From an Input to an Output 

Table 1 3-21 defines the timing parameter for changing the XF pin from an in- 
put pin to an output pin. The number shown in parentheses in Figure 13-22 
corresponds with the number in the No. column of Table 13-21 . 



Table 13-21. Timing Parameters ofXF Changing From Input to Output Mode 









'030-27 
'031-27 


'030-33 
'031-33 
'LC31 


'030-40 
'031-40 


'031-50 




No. 


Name 


Description 


Min Max 


Min Max 


Min Max 


Min Max 


Unit 


0) 


^(HSH-XFIO) 


H3 high to XF switching 
from input to output delay 


25 


20 


17 


17 


ns 



Figure 13-22. Timing for Change ofXF From Input to Output Mode 

i Execution of , , 

| LoadoflOF | | 

I r 
I 



IOXF 
Bit 



*-* (1) 



XF Pin 
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13.5.9 Reset Timing 

RESET is an asynchronous input that can be asserted at any time during a 
clock cycle. If the specified timings are met, the exact sequence shown in 
Figure 13-23 on page 13-48 will occur; otherwise, an additional delay of one 
clock cycle is possible. 

The asynchronous reset signals include XFO/1, CLKXO/1, DXO/1, FSXO/1, 
CLKRO/1 , DRO/1 , FSRO/1 , and TCLKO/1 . 

Ta ble 13-2 2 ('C30) and Table 13-23 ('C31) define the timing parameters for 
the RESET signal. The numbers shown in parentheses in Figure 1 3-23 corre- 
spond with those in the No. column of Table 13-22 or Table 13-23. 

Resetting the device initializes the primary and expansion bus control regis- 
ters to seven software wait states and therefore results in slow external ac- 
cesses until these registers are initialized. 

Note also that HOLD is an asynchronous input and can be asserted during 
reset. 
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Table 13-22. Timing Parameters for RESET for the TMS320C30 









'C30-27 


'C30-33 


'C30-40 




No. 


Name 


Description 


Min 


Max 


Min 


Max 


Min 


Max 


Unit 


(1) 


tsu(RESET) 


Setup for RESET before 
CLKIN low 


28 


pT9 


10 


P« 


10 


PT9 


ns 


(2.1) 


*d(CLKINH-H1H) 


CLKIN high to H1 high delay* 


6 


20 


4 


14 


2 


12 


ns 


(2.2) 


*d(CLKINH-H1L) 


CLKIN high to H1 low delay* 


6 


20 


4 


14 


2 


12 


ns 


(3) 


*su(RESETH-H1L) 


Setup for RESET high 
before H1 low and after 10 H1 
clock cycles 


13 




10 




9 




ns 


(5.1) 


*d(CLKINH-H3L) 


CLKIN high to H3 low delay* 


6 


20 


4 


14 


2 


12 


ns 


(5.2) 


*d(CLKINH-H3H) 


CLKIN high to H3 high delay* 


6 


20 


4 


14 


2 


12 


ns 


(8) 


tdis(H1H-(X)D) 


H1 high to (X)D disabled (high 
impedance) 




19t 




15' 




13T 


ns 


(9) 


tdis(H3H-(X)A) 


H3 high to (X)A disabled (high 
impedance) 




13t 




10t 




9t 


ns 


(10) 


*d (H3H-CONTROLH) 


H3 high to control signals high 
delay 




•tot 

13 T 




10 T 




9' 


ns 


(11) 


td(HIH-RWH) 


H1 high to R/W high delay 




13* 




10* 




9* 


ns 


(13) 


*d(H1H-IACKH) 


H1 high to IACK high delay 




13* 




10* 




9* 


ns 


(14) 


^dis(RESETL-ASYNCH) 


RESET low to asynchronous- 
ly reset signals disabled (high 
impedance) 




31* 




25* 




21* 


ns 



* Characterized but not tested 

* See Figure 1 3-24 for temperature dependence for the 33-MHz TMS320C30. See Figure 1 3-25 for temperature dependence 
for the 40-MHz TMS320C30. 



P-tc(CI) 
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Table 13-23. Timing Parameters for RESET for the TMS320C31 









'C31-27 


'C31-33 
'LC31 


'C31-40 


'C31-50 




No. 


Name 


Description 


Min 


Max 


Min 


Max 


Min 


Max 


Min 


Max 


Unit 


(1) 


^su (RESET) 


Setup for RESET 
before CLKIN low 


28 


ptfl 


10 


ptH 


10 


ptH 


10 


ptl 


ns 


(2.1) 


td(CLKINH-HIH) 


CLKINhightoHl 
high delay § # 


2 


12 


2 


12* 


2 


12 


2 


10 


ns 


(2-2) 


td(CLKINH-HIL) 


CLKIN high toH1 
low delay § # 


2 


12 


2 


12* 


2 


12 


2 


10 


ns 


(3) 


*su(RESETH-H1L) 


Setup for RESET 
high before H1 
low and after 10 
H1 clock cycles 


13 




10 




9 




7 




ns 


(5.1) 


K Q (OLKJ N H— H3L) 


CLKIN high to H3 
low delay § # 


2 


12 


2 


12+ 


2 


12 


2 


10 


ns 


(5.2) 


*d (CLKIN H-H3H) 


CLKIN high to H3 
high delay § # 


2 


12 


2 


12* 


2 


12 


2 


10 


ns 


(8) 


tdisfHIH-PQD) 


H1 high to D 
disabled (high 
impedance) 




19t 




15* 




13* 




12* 


ns 


(9) 


t dis(H3H-(X)A) 


H3 high to A 
disabled (high 
impedance) 




13t 




10t 




9t 




8* 


ns 


(10) 


u^non— v/v-mn i nvjLny 


H3 high to 
control signals 
high delay 




13t 




10* 




9* 




8* 


ns 


(12) 


^d(HIH-RWH) 


H1 high to R/W 
high delay 




13t 




10* 




9* 




8* 


ns 


(13) 


td(HIH-IACKH) 


H1 high to IACK 
high delay 




13t 




10* 




9* 




8* 


ns 


(14) 


*dis(RESETL-ASYNCH) 


RESET low to 
asynchronously 
reset signals dis- 
abled (high im- 
pedance) 




3lt 




25* 




21* 




17* 


ns 



* Characterized but not tested 

* 1 4 ns for the extended temperature 'C31 -33 

§See Figure 13-25 for temperature dependence for the TMS320C31-27, TMS320C31-33, and the extended-temperature 
TMS320C31-33. 



;P-tc(C0 

ff See Figure 1 3-26 for temperature dependence for the TMS320C31 -50. 
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Figure 13-23. Timing for RESET 

CLK,N AAAAAAAAA/^^AAAAAAAA 

-*t w-0) I 
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Notes: 1) (X)D Includes D31-D0 and XD31-XD0. 
2) (X)A Includes A23-A0 and XA12-XA0. 



3) Control signals Include STRB, MSTRB, and IOSTRB. 

4) Asynchronously reset signals include XFO/1 , CLKXO/1 , DXO/1 , FSXO/1 , CLKRO/1 , DRO/1 , FSRO/1 , and TCLKO/1 . 

5) RESET is an asynchronous input and can be asserted at any point during a clock cycle. If the specified timings are 
met, the exact sequence shown will occur; otherwise, an additional delay of one clock cycle is possible. 

6) Note that the R/W and XR/W outputs are placed in a high-impedance state during reset and can be provided with 
a resistive pull-up, nominally 1 8-22 kQ, if undesirable spurious writes could be caused when these outputs go low. 

7) In microprocessor mode, the reset vector is fetched twice, with seven software wait states each time. In microcom- 
puter mode, the reset vector is fetched twice, with no software wait states. 
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Figure 13-24. CLKIN to H1/H3 as a Function of Temperature 
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Figure 13-25. CLKIN to H1/H3 as a Function of Temperature 
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Figure 13-26. CLKIN to H1/H3 as a Function of Temperature 
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13.5.10 SHZ Pin Timing 

Table 13-24 defines the timing parameters for the SHZ pin. The numbers 
shown in parentheses in Figure 13-27 correspond with those in the No. col- 
umn of Table 13-24. 



Table 13-24. Timing Parameters for the SHZ Pin 









'C30 
'C31 
'LC31 




No. 


Name 


Description 


Min Max 


Unit 


0) 


tdis(SHZ) 


SHZ low to all 0, I/O pins disabled 
(high impedance) 


0t 2Pt* 


ns 


(2) 


ten(SHZ) 


SHZ high to all 0, I/O pins enabled 
(active) 


Ot 2Pt* 


ns 



t Characterized but not tested 



*P-tc(CI) 



Figure 13-27. Timing for SHZ Pin 
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Note: Enabling SHZ destroys TMS320C3x register and memory contents. Assert SHZ = 1 and reset the TMS320C3x to restore 
it to a known condition. 
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1 3.5.1 1 Interrupt Response Timing 

Table 1 3-25 defines the timing parameters for the INT signals. The numbers 
shown in parentheses in Figure 13-28 correspond with those in the No. col- 
umn of Table 13-25. 



Table 13-25. Timing Parameters for INT3-INT0 









'C30-27 
'C31-27 


'C30-33 
'C31-33 
'LC31 


'C30-40 
'C31-40 


'C31-50 




No. 


Name 


Description 


Min Max 


Min Max 


Min Max 


Min Max 


Unit 


0) 


tsu(INT) 


INT3-INT0 setup before H1 
low 


19 


15 


13 


10 


ns 


(2) 


Mint) 


Interrupt pulse duration to 
guarantee only one interrupt 


P 2Ptt 


P 2Ptt 


P 2Pt* 


P 2Pt* 


ns 



t Characterized but not tested 



*P = t c(H ) 

The interrupt (TNT) pins are asynchronous inputs that can be asserted at any 
time during a clock cycle. The TMS320C3x interrupts are level-sensitive, not 
edge-sensitive. Interrupts are detected on the falling edge of H1 . Therefore, 
interrupts must be set up and held to the falling edge of H 1 for proper detection. 
The CPU and DMA respond to detected interrupts on instruction fetch bound- 
aries only. 

For the processor to recognize only one interrupt on a given input, an interrupt 
pulse must be set up and held to: 

□ A minimum of one H1 falling edge, and 

□ No more than two H1 falling edges. 

The TMS320C3x can accept an interrupt from the same source every two H1 
clock cycles. 

If the specified timings are met, the exact sequence shown in Figure 1 3-28 will 
occur; otherwise, an additional delay of one clock cycle is possible. 
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Figure 13-28. Timing for INT3-INT0 Response 
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1 3.5.1 2 Interrupt Acknowledge Timing 

The IACK output goes active on the first half-cycle (HI rising) of the decode 
phase of the IACK instruction and goes inactive at the first half-cycle (HI rising) 
of the read phase of the IACK instruction. 

Table 1 3-26 defines the timing parameters for the IACK signal. The numbers 
shown in parentheses in Figure 13-29 correspond with those in the No. col- 
umn of Table 13-26. 



Table 13-26. Timing Parameters for IACK 









'C30-27 
'C31-27 


'C30-33 
'C31-33 
'LC31 


'C30-40 
'C31-40 


'C31-50 




No. 


Name 


Description 


Min Max 


Min Max 


Min Max 


Min Max 


Unit 


(1) 


*d(H1H-IACKL) 


Hi high to iACK iow delay 


* r\ 
IO 


1 V 


n 

o 




ns 


(2) 


td(HIH-IACKH) 


H1 high to IACK high delay 


13 


10 


9 


7 


ns 



Note: The IACK output is active for the entire duration of the bus cycle and is therefore extended if the bus cycle utilizes wait 
states. 



Figure 13-29. Timing for IACK 
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1 3.5.1 3 Data Rate Timing Modes 



Unless otherwise indicated, the data rate timings shown in Figure 13-30 and 
Figure 13-31 are valid for all serial port modes, including handshake. For a 
functional description of serial port operation, refer to subsection 8.2.12 on 
page 8-30. 

Table 1 3-27 defines the serial port timing parameters for eight 'C3x devices. 
The numbers shown in parentheses in Figure 13-30 and Figure 13-31 corre- 
spond with those in the No. column of Table 13-27. 



Figure 13-30. Timing for Fixed Data Rate Mode 
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Notes: 1) Timing diagrams show operations with CLKXP = CLKRP = FSXP = FSRP = 0. 

2) Timing diagrams depend on the length of the serial port word, where n = 8, 16, 24, or 32 bits, respectively. 
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Figure 13-31. Timing for Variable Data Rate Mode 




Notes: 1) Timing diagrams show operation with CLKXP = CLKRP = FSXP = FSRP = 0. 

2) Timing diagrams depend on the length of the serial port word, where n = 8, 16, 24, or 32 bits, respectively. 

3) The timings that are not specified expressly for the variable data rate mode are the same as those that are specified 
for the fixed data rate mode. 
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Table 13-27. Serial-Port Timing Parameters 











TMS320C30-27/TMS320C31 -27 




No. 


Name 


Description 




Min 


Max 


Unit 


(1) 


tym-sci^ 


H1 high to internal CLKX/R 


delay 




19 


ns 


(2) 


tc(SCK) 


CLKX/R cycle time 


CLKX/R ext 
CLKX/R int 


tc(H)X2 


tc(H)*2 32 * 


ns 


(3) 


tw(SCK) 


CLKX/R high/low pulse 
duration 


CLKX/R ext 
CLKX/R int 


tc(H)+12t 
[tc(SCK)/2]-15 


Pc(SCK)/2]+5 


ns 


(4) 


V(schg 


CLKX/R rise time 






10t 


ns 


(5) 


tffcnio 


CLKX/R fall time 






10t 


ns 


(6) 


*d(DX) 


CLKXto DX valid delay 


CLKX ext 
CLKX int 




44 
25 


ns 


(7) 


tsu(DR) 


DR setup before 
CLKR low 


CLKR ext 
CLKR int 


13 
31 




ns 


(8) 


*h(DR) 


DR hold from 
CLKR low 


CLKR ext 
CLKR int 


13 
0 




ns 


(9) 


*d(FSX) 


CLKXto internal 
FSX high/low delay 


CLKX ext 
CLKX int 




40 
21 


ns 


(10) 


*su(FSR) 


FSR setup before CLKR 
low 


CLKR ext 
CLKR int 


13 
13 




ns 


(11) 


th(FS) 


FSX/R input hold from 
CLKX/R low 


CLKX/R ext 
CLKX/R int 


13 
0 




ns 


(12) 


*su(FSX) 


External FSX setup be- 
fore CLKX 


CLKX ext 
CLKX int 


-[tc(H)-8] 
4t C (H)-21] 


[tc(SCK)/2M0* 
*c(SCK)/2* 


ns 


(13) 


*d(CH-DX)V 


CLKXto first DX bit, FSX 
precedes CLKX high 
delay 


CLKX ext 




45 


ns 






CLKX int 




26 




(14) 


td(FSX-DX)V 


FSX to first DX bit, CLKX precedes FSX 
delay 




45 


ns 


(15) 


td(DXZ) 


CLKX high to DX high impedance following 
last data bit delay 




25t 


ns 



t Guaranteed by design but not tested 
* Not tested 
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Table 13-27. Serial-Port Timing Parameters (Continued) 











TMS320C30-33/TMS320C31 -33/ 
TMS320LC31 




No. 


Name 


Description 




Min 


Max 


Unit 


0) 


tym-scK) 


H1 high to internal CLKX/R delay 




15 


ns 


(2) 


tySCK) 


CLKX/R cycle time 


CLKX/R ext 
CLKX/R int 


Wunx2.6l" 
*c(H)X2 


tc(H)*232* 


ns 


(3) 


MSCK) 


CLKX/R high/low pulse 
duration 


CLKX/R ext 
CLKX/R int 


rU/e^u^/21-1 5 


rt/s/o^i^\/21+5 


ns 


(4) 


tr(SCK) 


CLKX/R rise time 






8t 


ns 


(5) 


tf(SCK) 


CLKX/R fall time 






8t 


ns 


(6) 


^d(DX) 


CLKX to DX valid delay 


CLKX ext 
CLKX int 




35 
20 


ns 


(7) 


*su(DR) 


DR setup before 
CLKR low 


CLKR ext 
CLKR int 


10 
25 




ns 


(8) 


^(DR) 


DR hold from 
CLKR low 


CLKR ext 
CLKR int 


10 
0 




ns 


(9) 


*d(FSX) 


olj\a to internal 
FSX high/low delay 


OLrvX ext 
CLKX int 




32 
17 


ns 


(10) 


tsu(FSR) 


FSR setup before 
CLKR low 


CLKR ext 
CLKR int 


10 
10 




ns 


(11) 


*h(FS) 


FSX/R input hold from 
CLKX/R low 


CLKX/R ext 
CLKX/R int 


10 
0 




ns 


(12) 


teiifFQYN 


External FSX setup be- 
fore CLKX 


CLKX ext 
CLKX int 


~Pc(H)-8] 
[tc(Hr21] 


[tc(SCK)/2H0* 
tySCK)/ 2 * 


ns 


(13) 


^(CH-DXJV 


CLKX to first DX bit, 
FSX precedes 
CLKX high delay 


CLKX ext 
CLKX int 




36 
21 


ns 


(14) 


^(FSX-DXJV 


FSX to first DX bit, CLKX precedes FSX 
delay 




36 


ns 


(15) 


td(DXZ) 


CLKX high to DX high impedance follow- 
ing last data bit delay 




20t 


ns 



t Guaranteed by design but not tested 
*Not tested 
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Table 13-27. Serial-Port Timing Parameters (Continued) 











TMS320C30-40/TMS320C31 -40 




No. 


Name 


Description 




Min 


Max 


Unit 


(1) 




H1 high to internal CLKX/R delay 




13 


ns 


(2) 


tc(SCK) 


CLKX/R cycle time 


CLKX/R ext 
CLKX/R int 


tc (H )x2.6t 
*c(H)><2 


tc(H)*232* 


ns 


(3) 


tw(SCK) 


CLKX/R high/low pulse 
duration 


CLKX/R ext 
CLKX/R int 


t c ( H )+10t 


[tc(SCK)/2]+5 


ns 


(4) 


tr(SCK) 


CLKX/R rise time 






7t 


ns 


(5) 


tf(SCK) 


CLKX/R fall time 






7t 


ns 


(6) 


td(DX) 


CLKX to DX valid delay 


CLKX ext 
CLKX int 




30 
17 


ns 


(7) 


tsu(DR) 


DR setup before 
CLKR low 


CLKR ext 
CLKR int 


9 
21 




ns 


(8) 


th/nR\ 

\\\\Jv\) 


DR hold from 
CLKR low 


CLKR ext 
CLKR int 


9 
0 




ns 


(9) 


td(FSX) 


CLKX to internal 
FSX high/low delay 


CLKX ext 
CLKX int 




27 
15 


ns 


(10) 


*su(FSR) 


FSR setup before 
CLKR low 


CLKR ext 
CLKR int 


9 

Q 




ns 


fin 


*h(FS) 


FSX/R input hold from 
CLKX/R low 


CLKX/R ext 
CLKX/R int 


9 
0 




1 IO 


(12) 


l SU(roA; 


External FSX setup be- 
fore CLKX 


CLKX ext 
CLKX int 




VI VVI XI 


ns 


(13) 


*d(CH-DX)V 


CLKX to first DX bit, FSX 
precedes CLKX high 
delay 


CLKX ext 
CLKX int 




30 
18 


ns 


(14) 


td(FSX-DX)V 


FSX to first DX bit, CLKX precedes FSX 
delay 




30 


ns 


(15) 


td(DXZ) 


CLKX high to DX high impedance following last 
data bit delay 




17t 


ns 



t Guaranteed by design but not tested 
* Not tested 



TMS320C3x Signal Descriptions and Electrical Characteristics 



13-59 



Timing 



Table 13-27. Serial-Port Timing Parameters (Continued) 



No. 


Name 


Description 




TMS320C31-50 


Unit 


Min 


Max 


(1) 


tym-scK) 


H1 high to internal CLKX/R delay 




10 


ns 


(2) 


tc(SCK) 


CLKX/R cycle time 


CLKX/R ext 


t~/u\ x 2.6t 




ns 






CLKX/R int 


Wl-h x 2 




(3) 


t w (SCK) 


CLKX/R high/low pulse dura- 


CLKX/R ext 


l C(H) r 1 u 




ns 




tion 


CLKX/R int 


rWqrio/21-5 




(4) 


t r (sCK) 


CLKX/R rise time 






6t 


ns 


(5) 


tf(SCK) 


CLKX/R fall time 






6 T 


ns 


( 6 ) 


*d(DX) 


CLKXto DX valid delay 


CLKX ext 




24 


ns 






CLKX int 




16 




("7\ 


*su(DR) 


DR setup before CLKR low 


CLKR ext 


9 




ns 






CLKR int 


17 








th(DR) 


DR hold from CLKR low 


CLKR ext 


7 




ns 






CLKR int 


0 








*d(FSX) 


CLKX to internal FSX high/ 


CLKX ext 




22 


ns 




low delay 


CLKX int 








MO) 


I . /r~ r>v 

l su(FSR) 


FSR setup before CLKR low 


CLKR ext 


7 




1 1© 






CLKR int 


7 






/■m \ 
(■•) 


*h(FS) 


FSX/R input hold from 


CLKX/R ext 


7 




ns 




CLKX/R low 


CLKX/R int 


0 






(12) 


*su(FSX) 


External FSX setup before 


CLKX ext 


1! c(H) il 


[t C (SCK)/2]-10* 


ns 




CLKX 


CLKX int 


-Bc(H)-21]. 


tySCK)/ 2 * 




(13) 


*d(CH-DX)V 


CLKX to first DX bit, FSX pre- 


CLKX ext 




24 


ns 


cedes CLKX high delay 


CLKX int 




14 




(14) 


*d(FSX-DX)V 


FSX to first DX bit, CLKX precedes FSX 
delay 




24 


ns 


(15) 


*d(DXZ) 


CLKX high to DX high impedance following 




14t 


ns 




last data bit delay 











t Assured by design but not tested 
* Not tested 
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13.5.14 HOLD Timing 



HOLD is an asynchronous input that can be asserted at any time during a clock 
cycle. If the specified timings are met, the exact sequence shown in 
Figure 13-32 will occur; otherwise, an additional delay of one clock cycle is 
possible. 



Table 1 3-28 defines the timing parameters for the HOLD and HOLDA signals. 
The numbers shown in parentheses in Figure 1 3-32 correspond with those in 
the No. column of Table 13-28. 

The NOHOLD bit of the primary bus control register (see subsection 7.1 .1 on 
page 7-3) overrides the HOLD signal. When this bit is set, the device comes 
out of hold and prevents future hold cycles. 



Asserting HOLD prevents the processor from accessing the primary bus. Pro- 
gram execution continues until a read from or a write to the primary bus is re- 
quested. In certain circumstances, the first write will be pending, thus allowing 
the processor to continue until a second write is encountered. 



Figure 13-32. Timing for HOLD/HOLDA 




Note: HOLDA will go low in response to HOLD going low and will continue to remain low until one H1 cycle after HOLD goes 
back high, as shown in Figure 13-32. 
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Table 13-28. Timing Parameters for HOLD/HOLDA 









'C30-27 
'C31-27 


'C30-33 
'C31-33 
'LC31 


'C30-40 
'C31-40 


•C31-50 




No. 


Name 


Description 


Min 


Max 


Min 


Max 


Min 


Max 


Min 


Max 


Unit 


(1) 


tsu(HOLD) 


HOLD setup 
before H1 low 


19 




15 




13 




10 




ns 


(3) 


VflHOLDA) 


HOLDA valid 
after H1 low 


0* 


14 


0* 


10 


0* 


9 


0* 


7 


ns 


(4) 


*w(HOLD^) 


HOLD low du- 
ration 


2tc(H) 




2tc(H) 




2tc(H) 




2tc(H) 




ns 


(6) 


tw(HOLDA) 


HOLDA low du- 
ration 


tcH-5* 




tcH^ 




tcH-5+ 




tcH"5t 




ns 


/"7\ 
(') 


td(HIL-SH)H) 


n 1 IOW tO 
STRB high for 
a HOLD delay 


0+ 


13 


0* 


10 


0* 


9 


0* 


-7 

7 


ns 




kjisfrllL-S) 


H1 low to 

ill ivyvv iw 

STRB disabled 
(high-impe- 
dance state) 


0* 


13t 


0* 


10t 


0* 


9t 


0* 


8t 


ns 


(9) 


tsm/l_H 1 0\ 

^(rllL— o) 


H1 low to 
STRB enabled 
(active) 


0* 


13 


0* 


10 


0* 


g 


0* 


7 


ns 


no) 


kJis(HIL-RW) 


H1 low to R/W 

III IV/ VV \\J 11/11 

disabled (high- 
impedance 
state) 


0* 


13t 


0* 


10t 


0* 


9t 


0* 


8t 


ns 


on 


/I I ^ ■ P")\ » A 

^n(HIL--HW) 


H1 low to R/W 

III Ivff \\J 1 1/ V ¥ 

enabled (ac- 
tive) 




13 


0* 


10 


0* 


g 


0* 


7 


ns 


v 1 


T /II j ■ ax 

L-A) 


H1 low to ad- 

III IwVf lw 

dress disabled 
(high-impe- 
dance state) 


0* 


13t 


0* 


10t 


0* 




0* 


8t 


ns 


(13) 


ton^HII — A^ 


H1 low to ad- 
dress enabled 
(valid) 


0* 


19 


0* 


15 


0* 


13 


0* 


12 


ns 


(16) 


^dis(HIH-D) 


H1 high to data 
disabled (high- 
impedance 
state) 


0* 


13t 


0* 


10t 


0* 


9t 


0* 


8t 


ns 



t Characterized but not tested 
* Not tes ted 

§ HOLD is an asynchronous input and can be asserted at any point during a clock cycle. If the specified timings are met, the exact 
sequence shown will occur; otherwise, an additional delay of one clock cycle is possible. 
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1 3.5.1 5 General-Purpose I/O Timing 

Peripheral pins include CLKXO/1 , CLKRO/1 , DXO/1 , DRO/1 , FSXO/1 , FSRO/1 , 
and TCLKO/1 . The contents of the internal control registers associated with 
each peripheral define the modes for these pins. 

1 3.5. 15. 1 Peripheral Pin I/O Timing 

Table 13-29 defines peripheral pin general-purpose I/O timing parameters. 
The numbers shown in parentheses in Figure 1 3-33 correspond with those in 
the No. column of Table 13-29. 



Table 13-29. Timing Parameters for Peripheral Pin General-Purpose I/O 









'030-27 
'C31-27 


'C30-33 
'C31-33 
'LC31 


'C30-40 
'C31-40 


'C31-50 




No. 


Name 


Description 


Min Max 


Min Max 


Min Max 


Min Max 


Unit 


(1) 


tsuCGPIOHIL) 


General-purpose input setup 
before H1 low 


15 


12 


10 


9 


ns 


(2) 


th(GPIOHIL) 


General-purpose input hold 
time after H1 low 


0 


0 


0 


0 


ns 


(3) 


^(GPIOHIH) 


General-purpose output 
delay after H1 high 


19 


15 


13 


10 


ns 



Note: Peripheral pins include CLKXO/1 , CLKRO/1 , DXO/1 , DRO/1 , FSXO/1 , FSRO/1 , and TCLKO/1 . The modes of these pins are 
defined by the contents of internal control registers associated with each peripheral. 



Figure 13-33. Timing for Peripheral Pin General-Purpose I/O 




13.5. 15.2 Changing the Peripheral Pin I/O Modes 

Table 13-30 and Table 13-31 show the timing parameters for changing the 
peripheral pin from a general-purpose output pin to a general-purpose input 
pin and vice versa. The numbers shown in parentheses in Figure 13-34 and 
Figure 13-35 correspond to those shown in the No. column of Table 13-30 
and Table 13-31, respectively. 
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Table 13-30. Timing Parameters for Peripheral Pin Changing From General-Purpose Output 
to Input Mode 









'C30-27 
'C31-27 


'C30-33 
•C31-33 
'LC31 


'C30-40 
'C31-40 


'C31-50 




No. 


Name 


Description 


Min Max 


Min Max 


Min Max 


Min 


Max 


Unit 


0) 


th(H3H) 


Hold after H1 high 


19 


15 


13 




10 


ns 


(2) 


tsu(GPIOH1L 


Peripheral pin setup before 
H1 low 


13 


10 


9 


9 




ns 


(3) 


^(GPIOHIL 


Peripheral pin hold after H1 
low 


0 


0 


0 


0 




ns 



Table 13-31. Timing Parameters for Peripheral Pin Changing From General-Purpose Input to 









'C30-27 
'C31-27 


'C30-33 
'C31-33 
'LC31 


'C30-40 
'C31-40 


'C31-50 




No. 


Name 


Description 


Min Max 


Min Max 


Min Max 


Min Max 


Unit 


(1) 


td(GPIOHIH) 


H1 high to peripheral pin 
switching from input to out- 
put delay 


19 


15 


13 


10 


ns 



Figure 13-34. Timing for Change of Peripheral Pin From General-Purpose Output to 
Input Mode 
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Figure 13-35. Timing for Change of Peripheral Pin From General-Purpose Input to 
Output Mode 
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1 3.5.1 6 Timer Pin Timing 

Valid logic-level periods and polarity are specified by the contents of the inter- 
nal control registers. 

Table 13-32 and Table 13-33 define the timing parameters for the timer pin. 
The numbers shown in parentheses in Figure 1 3-36 correspond with those in 
the No. column of Table 13-32 and Table 13-33. 



Table 13-32. Timing Parameters for Timer Pin 











'oso-wcsi^y 


'C30-33/'C31-33 




No. 


Name 


Description* 


Min Max 


Min 


Max 


Unit 


0) 


tsufTCLKHIL 


TCLK ext 
setup before 
H1 low 


TCLK 
ext 


15 






ns 


(2) 


tyi(TCLKH1L 


TCLK ext 
hold after 
H1 low 


TCLK 
ext 


0 


0 




ns 


(3) 


tdfTCLKHIH) 


H1 high to 
TCLK int 
valid delay 


TCLK 
int 


13 




10 


ns 


(4) 


tcfTCLK) 


TCLK cycle 
time 


TCLK 
ext 


tc (H )x2.6t 


tc (H )x2.6t 




ns 








TCLK 
int 


tc(H)><2 tc (H )x232t 


tc(H)><2 


*c(H)*232t 


ns 


(5) 


twfTCLK) 


TCLK high/ 
low pulse 
duration 


TCLK 
ext 


tc(H)+12t 


tc(H)+12t 




ns 








TCLK 
int 


PcCtcljV 2 H5 [tcfrcuV 2 ^ 5 


Pc(TCLK)/2H5 


fc(TCLK)/2]+5 


ns 



t Guaranteed by design but not tested 

* Timing parameters 1 and 2 are applicable for a synchronous input clock. Timing parameters 4 and 5 are applicable for an 
asynchronous input clock. 
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Timing 



Table 13-33. Timing Parameters for Timer Pin 













'C31-50 




No. 


Name 


Description* 




Min 


Max 


Min 


Max 


Unit 


0) 


tsu(TCLKH1 L) 


TCLK ext set- 
up before H1 
low 


TCLK 
ext 


10 




8 




ns 


(2) 


th(TCLKH1L) 


TCLK ext hold 
after H1 low 


TCLK 
ext 


0 




0 




ns 


(3) 




H1 high to 
TCLK Int valid 
delay 


TCLK 
int 




9 




9 


ns 


(4) 


tc(TCLK) 


TCLK cycle 
time 


TCLK 
ext 


tc (H )x2.6t 




tc (H )x2.6t 




ns 








TCLK 
int 


tc(H)><2 


tc(H)x2 32t 


tc(H)* 2 


^(H)^ 


ns 


(5) 


twfTCLK) 


TCLK high/ 
low pulse du- 
ration 


TCLK 
ext 






tc(H)+10t 




ns 








TCLK 
int 


flc(TCLK)/2]-5 


[tc(TCLK)/2]+5 


Pc(TCLK)/2]^ 


PcCTCL^/2]+5 


ns 



t Guaranteed by design but not tested 

* Timing parameters 1 and 2 are applicable for a synchronous input clock. Timing parameters 4 and 5 are applicable for an 
asynchronous input clock. 



Figure 13-36. Timing for Timer Pin 
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Instruction Opcodes 



The opcode fields for all TMS320C3x instructions are shown in Table A-1 . Bits 
in the table marked with a hyphen are defined in the individual instruction de- 
scriptions (see Chapter 1 0) . Table A-1 , along with the instruction descriptions, 
fully defines the instruction words. The opcodes are listed in numerical order. 
Note that an undefined operation may occur if an illegal opcode is executed. 
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Table A-1. TMS320C3x Instruction Opcodes 



INSTRUCTION 
■raw i nvjw i iv/ii 


O 1 


10 


90 


98 


97 

£.1 


98 


98 


94 


93 


ABSF 




ft 

Vl 


ft 

V 


ft 


ft 


ft 
V/ 


ft 

V 


ft 
V/ 


ft 
w 


ABSI 


o 


ft 


ft 

V/ 


ft 

V 


ft 


o 


o 


o 


1 


ADDC 


o 


o 


0 


o 


o 


o 


o 


1 


o 


ADDF 


o 


o 


0 


o 


o 


o 


o 


1 


1 


ADDI 


0 


ft 

V/ 


ft 
u 


ft 

V/ 


0 


ft 

w 


-j 


ft 


ft 

w 


AND 


n 

VI 


ft 

V/ 


ft 

V 


0 

V» 


ft 


ft 

v# 


1 

1 


ft 

V/ 


1 

1 




n 

V 


ft 


ft 


ft 
vi 


ft 

VI 


ft 

VI 


1 

1 


1 

1 


ft 

VI 


AQM 
nOn 


u 


ft 
u 


ft 

u 


ft 


ft 
u 


ft 






i 

i 


pypc 

v/IVIr r 


u 


ft 

u 


ft 
u 


ft 
u 


ft 

u 




ft 
u 


ft 

VI 


ft 
u 


Vvlvlr 1 


u 


ft 

vl 


ft 
u 


ft 
vi 


ft 

VI 




ft 

VI 


ft 

VI 


1 


FIY 
riA 


n 

u 


ft 
vi 


ft 

Vl 


ft 
vi 


ft 

VI 




ft 

VI 


1 

1 


ft 

VJ 


FL OAT 


n 


0 
\j 


ft 


ft 

VI 


0 




ft 


1 

1 


1 

I 


IDLE 




0 


ft 


0 


ft 

w 




1 


0 


o 


IDLE2 


o 


o 


o 


o 


o 


1 


1 


o 


o 


LDE 


0 


o 


o 


o 


o 


1 


1 


o 


1 


LDF 


o 


0 


ft 

V 


0 


ft 


1 


1 


1 


ft 

w 


1 HFI 


ft 
vi 


ft 
vi 


vi 


ft 

VI 


VI 




1 


i 


1 

I 




U 


U 


ft 
U 


ft 

U 


•4 
1 


u 


U 


ft 
u 


U 


1 l"MI 

Lull 


0 


0 


0 


0 


1 


0 


0 


0 


1 


LDM 


0 


0 


0 


0 


1 


0 


0 


1 


0 


LDP 


0 


0 


0 


0 


1 


0 


0 


0 


0 


1 OI_l 

Lon 


0 


0 


0 


0 


1 


0 


0 


1 


4 

1 


1 HPOWFP 
LUrUWCn 


ft 

u 


ft 

u 


ft 

u 


4 
1 


ft 

u 


ft 

u 


ft 

u 


ft 
vl 


1 


MAXSPEED 


ft 


0 


0 


1 


ft 

V/ 


0 


0 


ft 
V 


1 


MPYF 


0 


0 


0 


0 


1 


0 


1 


0 


0 


MPYI 


0 


0 


0 


0 


1 


0 


1 


0 


1 


NEGB 


0 


0 


0 


0 


1 


0 


1 


1 


0 


NEGF 


0 


0 


0 


0 


1 


0 


1 


1 


1 


NEGI 


0 


0 


0 


0 


1 


1 


0 


0 


0 
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Table A-1. TMS320C3x Instruction Opcodes (Continued) 



INSTRUCTION 


31 


30 


29 


28 


27 


26 


25 


24 


23 


NOP 


0 


0 


0 


0 


1 


1 


0 


0 


1 


NORM 


0 


0 


0 


0 


1 


1 


0 


1 


0 


NOT 


0 


0 


0 


0 


1 


1 


0 


1 


1 


POP 


0 


0 


0 


0 


1 


1 


1 


0 


0 


POPF 


0 


0 


0 


0 


1 


1 


1 


0 


1 


PUSH 


0 


0 


0 


0 


1 


1 


1 


1 


0 


PUSHF 


0 


0 


0 


0 


1 


1 


1 


1 


1 


OR 


0 


0 


0 


1 


0 


0 


0 


0 


0 


RND 


0 


0 


0 


1 


0 


0 


0 


1 


0 


ROL 


0 


0 


0 


1 


0 


0 


0 


1 


1 


ROLC 


0 


0 


0 


1 


0 


0 


1 


0 


0 


ROR 


0 


0 


0 


1 


0 


0 


1 


0 


1 


RORC 


0 


0 


0 


1 


0 


0 


1 


1 


0 


RPTS 


0 


0 


0 


1 


0 


0 


1 


1 


1 


STF 


0 


0 


0 


1 


0 


1 


0 


0 


0 


phi 
STFI 


0 


0 


0 


1 


0 


1 


0 


0 


1 


STI 


0 


0 


0 


1 


0 


1 


0 


1 


0 


STII 


0 


0 


0 


1 


0 


1 


0 


1 


1 


SIGI 


0 


0 


0 


1 


0 


1 


1 


0 


0 


ei idq 
oUdd 


0 


0 


0 


1 


0 


1 


4 

1 


0 


4 

1 


OUDv 


u 


A 

u 


A 

u 




A 

u 




4 

I 


4 

1 


A 

u 


SUBF 




n 

V 


o 

w 








1 

1 


1 

1 


1 


SUBI 


o 


o 


o 






o 


o 


o 


o 


SUBRB 


o 


o 


o 


1 


1 


o 


o 


o 


1 


SUBRF 


o 


0 


o 


1 


1 


o 


o 


1 


o 


SUBRI 


o 


o 


0 


1 


1 


o 


o 


1 


1 


IOID 


A 
u 


n 
u 


u 






A 

u 


1 


A 

u 


A 

u 


AUn 


A 

u 


A 


A 

u 


1 


1 


A 

0 


4 

1 


A 


< 

1 


IACK 


0 


0 


0 






0 


1 


1 


0 


ADDC3 


0 


0 


1 


0 


0 


0 


0 


0 


0 


Annuo 
AUUro 


A 

u 


A 

u 




A 


0 


A 

u 


0 


A 


4 

1 


ADDI3 


0 


0 




0 


0 


0 


0 


1 


0 


AND3 


0 


0 




0 


0 


0 


0 


1 


1 


ANDN3 


0 


0 




0 


0 


0 


1 


0 


0 


ASH3 


0 


0 




0 


0 


0 


1 


0 


1 


CMPF3 


0 


0 




0 


0 


0 


1 


1 


0 


CMPI3 


0 


0 




0 


0 


0 


1 


1 


1 
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Instruction Opcodes 



Table A-1 .TMS320C3x Instruction Opcodes (Continued) 



INSTRUCTION 


31 


30 


29 


28 


27 


26 


25 


24 


23 


LSH3 


0 


0 


'1 


0 


0 


! 


0 


0 


0 


MPYF3 


0 


0 


! 


0 


0 


! 


0 


0 


1 


MPYI3 


0 


0 


! 


0 


0 




0 


1 


0 


OR3 


0 


0 


i 


0 


0 


! 


0 


1 


1 


SUBB3 


0 


0 


1 


0 


0 




1 


0 


0 


SUBF3 


0 


0 


1 


0 


0 


1 


1 


0 


1 


SUB13 


0 


0 


1 


0 


0 


1 


1 


1 


0 


TSTB3 


0 


0 


! 


0 


0 


! 


1 


1 


1 


XOR3 


0 


0 




0 


1 


0 


0 


0 


0 


LDFcond 


0 


1 


0 


0 












LDlcond 


0 


1 


0 


1 












BR(D)t 


0 




1 


0 


0 


0 


0 


— 


— 


CALL 


0 


i 




0 


0 


0 


1 








RPTB 


0 




1 


0 


0 


1 


0 








SWI 


0 




! 


0 


0 


1 


1 







Bconc3f(D)t 


0 


1 


1 


0 


1 


0 


— 


— 


— 


DBcond(D)t 


0 






0 


1 


1 






— 


CALLcond 


0 


! 


1 


1 


0 


0 








TRAP cond 


0 


1 


1 


1 


0 


1 


0 


_ 




RETlcond 


0 


i 


1 


1 


1 


0 


0 


0 


0 


RETScond 


0 


1 


1 


1 


1 


0 


0 


0 


1 


MPYF3||ADDF3 




0 


0 


o 


0 


0 


0 


0 






1 


0 


0 


0 


0 


0 


0 


1 






1 


0 


0 


0 


0 


0 


1 


0 






1 


0 


0 


0 


0 


0 


1 


1 


... 


MPYF3||SUBF3 


1 


0 


0 


0 


o 


1 


0 


0 






1 


0 


0 


0 


0 


1 


0 


1 








0 


0 


0 


0 


1 


1 


0 








0 


0 


0 


0 


1 


1 


1 




MPYI3||ADDI3 




0 


0 


0 


1 


0 


0 


0 








0 


0 


0 


1 


0 


0 


1 








0 


0 


0 


1 


0 


1 


0 








0 


0 


0 


1 


0 


1 


1 





t Opcode same for standard and delayed instructions. 
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Table A-1.TMS320C3x Instruction Opcodes (Concluded) 



INSTRUCTION 31 


30 


29 


28 


27 


26 


25 


24 


23 


MPYI3||SUBI3 1 


0 


0 


0 


1 


1 


0 


0 


- 


1 


0 


0 


0 


1 


1 


0 


1 


- 




0 


0 


0 


1 


1 


1 


0 


- 


1 


0 


0 


0 


1 


1 


1 


1 


— 


STF||STF 1 


1 


0 


0 


0 


0 


0 


— 




STI||STI 1 


1 


0 


0 


0 


0 


1 


— 


— 


LDF||LDF 1 


1 


0 


0 


0 


1 


0 


— 


— 


LDI||LDI 1 


1 


0 


0 


0 


1 


1 


— 


— 


ABSF||STF 1 


1 


0 


0 


1 


0 


0 


— 


— 


ABSI||STI 1 


1 


0 


0 


1 


0 


1 


— 


— 


ADDF3||STF 1 


1 


0 


0 


1 


1 


0 


— 


- 


ADDI3||STI 1 


1 


0 


0 


1 


1 


1 


- 


- 


AND3||STI 1 


1 


0 


1 


0 


0 


0 


— 


- 


ASH3||STI 1 


1 


0 


1 


0 


0 


1 


— 


- 


FIX||STI 1 


1 


0 


1 


0 


1 


0 


— 


— 


FLOAT||STF 1 


1 


0 


1 


0 


1 


1 


— 


— 


LDF||STF 1 


1 


0 


1 


1 


0 


0 


— 


— 


LDI||STI 1 


1 


0 


1 


1 


0 


1 


— 


— 


LSH3||STI 1 


1 


0 


1 


1 


1 


0 


— 


— 


MPYF3||STF 1 


1 


0 


1 


1 


1 


1 


— 


— 


MPYI3||STI 1 


1 


1 


0 


0 


0 


0 


— 


— 


NEGF||STF 1 


1 


1 


0 


0 


0 


1 


- 


— 


NEGI||STI 1 


1 


1 


0 


0 


1 


0 


- 


- 


NOT||STI 1 


1 


1 


0 


0 


1 


1 






OR3||STI 1 






0 




0 


0 






SUBF3||STF 1 






0 




0 


1 






SUBI3||STI 1 






0 




1 


0 






XOR3||STI 1 






0 




1 


1 






Reserved for reset, 0 






1 




1 


1 


1 


1 


traps, and interrupts 
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Development Support/Part Ordering Information 



This appendix provides development support information, device part num- 
bers, and support tool ordering information for the TMS320C3x generation. 

Each TMS320C3x support product is described in the TMS320 Family Devel- 
opment Support Reference Guide (literature number SPRU011).ln addition, 
more than 100 third-party developers offer products that support the Tl 
TMS320 family. For more information, refer to the TMS320 Third-Party Refer- 
ence Guide (literature number SPRU052). 

For information on pricing and availability, contact the nearest Tl field sales of- 
fice or authorized distributor. 

This appendix discusses the following major topics: 

Topic Page 

B.1 TMS320C3x Development Support Tools ........................ B-2 

B.2 TMS320C3x Part Ordering Information B-7 



TMS320C3x Development Support Tools 



B.1 TMS320C3x Development Support Tools 

Texas Instruments offers an extensive line of development tools for the 
TMS320C3x generation of DSPs, including tools to evaluate the performance 
of the processors, generate code, develop algorithm implementations, and ful- 
ly integrate and debug software and hardware modules. 

The following products support development of 'C3x applications: 

Code Generation Tools 

□ Optimizing ANSI C compiler. Translates ANSI C language directly into 
highly optimized assembly code. You can then assemble and link this code 
with the Tl assembler/linker, which is shipped with the compiler. It supports 

Krv+K 'POv onrl , C*A\/ *aoe»£srv»Klw r*r\rlrs Thlo r\rr\ri i lo <m irr^n+lw r»v /r» ! I oklo 4r\r 

PC (DOS, DOS extended memory, and OS/2), VAX/VMS, and SPARC 
workstations. Refer to the TMS320 Floating-Point DSP Optimizing C 
Compiler User's Guide (SPRU034) for detailed information. 

□ Assembler/linker. Converts source mnemonics to executable object code. 
It supports both 'C3x and 'C4x assembly code. This product is currently 
available for PC (DOS, DOS extended memory, and OS/2). The 'C3x/'C4x 
assembler for the VAX/VMS and SPARC workstations is only available as 
part of the optimizing 'C3x/'C4x compiler. Refer to the TMS320 Floating- 
Point DSP Assembly Language Tools User's Guide (SPRU035) for de- 
tailed information. 

System Integration and Debug Tools 

□ Simulator. Simulates via software the operation of the 'C3x and can be 
used in C and assembly software development. This product is currently 
available for PC (DOS and Windows) and SPARC workstations. Refer to 
the TMS320C3x C Source Debugger User's Guide (SPRU054) for de- 
tailed information. 

□ XDS510 emulator. Performs full-speed in-circuit emulation with the 'C3x, 
providing access to all registers as well as to internal and external memory. 
It can be used in C and assembly software development and has the capa- 
bility of debugging multiple processors. This product is currently available 
for PC (DOS, Windows, and OS/2) and SPARC workstations. This product 
includes the emulator board (emulator box, power supply, and SCSI con- 
nector cables in the SPARC version), the 'C3x C source debugger soft- 
ware, and the JTAG cable. 
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TMS320C3x Development Support Tools 



Because 'C3x and 'C5x XDS51 0 emulators also come with the same emu- 
lator board (or box) , you can buy the 'C3x C source debugger software as a 
separate product called 'C3x C Source Debugger Conversion Software. 
This enables you to debug , C3x/'C4x/'C5x applications with the same 
emulator board. The emulator cable that comes with the 'C5x XDS510 
emulator is not compatible with the 'C3x. You need a JTAG emulation con- 
version cable. Refer to the TMS320C3x C Source Debugger User's Guide 
(SPRU053) for detailed information on the 'C3x emulator. 

□ Evaluation module (EVM). Each EVM comes complete with a PC halfcard 
and software package. The EVM board contains the following: 

■ A TMS320C30 and a 33-MFLOPS, 32-bit floating-point DSP 

■ A 1 6K-word, zero-state SRAM, allowing coding of most algorithms di- 
rectly on the board 

■ A speaker/microphone-ready analog interface for multimedia, 
speech, and audio applications development 

■ A multiprocessor serial port interface for connecting to multiple EVMs 

■ A host port for PC communications 

The system also comes with all the software required to begin applications 
development on a PC host. Equipped with a C and assembly language 
source level debugger for the DSP, the EVM has a window-oriented, 
mouse-driven interface that enables the downloading, executing, and de- 
bugging of assembly code or C code. 

The TMS320C3x assembler/linker is also included with the EVM. For us- 
ers who prefer programming in a high-level language, an optimizing ANSI 
C compiler and Ada compiler are offered separately. 
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□ Emulation porting kit (EPK). Enables you to integrate emulation technolo- 
gy directly into your system without the need of an XDS510 board. This 
product is intended to be used by third parties and high-volume board 
manufacturers and requires a licensing agreement with Texas Instru- 
ments. The kit contains host (or PC) source and object code, which lets 
you tailor 'C30 EVM-like capabilities to your TMS320C3x system via the 
SM74ACT8990 test bus controller (TBC). The EPK can be used in such 
applications as program download for system self-test and initialization or 
system emulation and debug to feature resident emulation support. EPK 
software includes the Tl high-level language (HLL) debugger in object as 
well as source code for the TBC communication interface. The HLL code 
is the windowed debugger found with many Tl DSP simulators, evaluation 
modules (EVMs), and emulators. With the EPK, the HLL user interface 
can be ported directly to the system board. The source code for the TBC 
communication interface consists of such commands as read/write, 
memory run, stop, and reset that communicate with the TMS320C3x de- 
vice. Using the EPK reduces system and development cost and speeds 
time to market. For more information on the kit, call the DSP hotline at 
(713) 274-2320. 

B.1 .1 TMS320 Third Parties 

The TMS320 family is supported by product and service offerings from more 
than 100 independent vendors and consultants, known as third parties. These 
support products take various forms (both software and hardware) from cross- 
assemblers, simulators, and DSP utility packages to logic analyzers and emu- 
lators. Additionally, Tl third parties offer more than 150 algorithms that are 
available for license through the TMS320 software cooperative. These algo- 
rithms can greatly reduce development time and decrease time to market. The 
expertise of those involved in support services ranges from speech encoding 
and vector quantization to software/hardware design and system analysis. 

For a more detailed description of services and products offered by third par- 
ties, refer to the TMS320 Third Party Support Reference Guide (literature 
number SPRU052) and the TMS320 Software Cooperative Data Sheet Pack- 
et (literature number SPRT1 1 1 ). Call the Literature Response Center at (800) 
477-8924 to request a copy. 
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B.1.2 TMS320 Literature 

Extensive DSP documentation is available; this includes data sheets, user's 
guides, and application reports. In addition, DSP textbooks that aid research 
and education have been published by Prentice-Hall, John Wiley and Sons, 
and Computer Science Press. To order literature or to subscribe to the DSP 
newsletter Details on Signal Processing (for up-to-date information on new 
products and services), call the Literature Response Center at (800) 
477-8924. 

B.1.3 DSP Hotline 

For answers to TMS320 technical questions on device problems, develop- 
ment tools, documentation, upgrades, and new products, you can contact the 
DSP hotline via: 

□ Phone at (713)274-2320 Monday through Friday from 8:30 a.m. to 5:00 
p.m. central time 

□ Fax at (713)274-2324 

□ Electronic mail at 4389750@mcimail.com. 

□ European fax at 33-1 -3070-1 032 

□ Semiconductor Product Information Center (PIC) at (214) 644-5580 

To ask about third-party applications and algorithm development packages, 
contact the third party directly. Refer to the TMS320 Third-Party Support Ref- 
erence Guide (literature number SPRU052) for addresses and phone 
numbers. 

Extensive DSP documentation is available; this includes data sheets, user's 
guides, and application reports. Call the hotline at (800) 477-8924 for informa- 
tion on literature that you can request from the Literature Response Center. 

The DSP hotline does not provide pricing information. Contact the nearest Tl 
field sales office or the Tl PIC for prices and availability of TMS320 devices and 
support tools. 

B.1.4 Bulletin Board Service (BBS) 

The TMS320 DSP Bulletin Board Service (BBS) is a telephone-line computer 
service that provides information on TMS320 devices, specification updates 
for current or new devices and development tools, silicon and development 
tool revisions and enhancements, new DSP application software as it be- 
comes available, and source code for programs from any TMS320 user's 
guide. 
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You can access the BBS via the following: 

□ Modem: (300-, 1 200-, or 2400-bps) dial (71 3)274-2323. Set your modem 
to 8 data bits,1 stop bit, no parity. 

□ Internet: Use anonymous ftpto ti.com (Internet port address 192.94.94.1). 
The BBS content is located in the subdirectory called mirrors. 

To find out more about the BBS, refer to the TMS320 Family Development 
Support Reference Guide (literature number SPRU011). 

B.1 .5 Technical Training Organization (TTO) TMS320 Workshop 

The TMS320C3x DSP design workshop is tailored for hardware and software 
design engineers and decision-makers who will be designing and utilizing the 
TMS320C3x generation of DSP devices. Hands-on exercises throughout the 
course give participants a rapid start in utilizing TMS320C3x design skills. Mi- 
croprocessor/assembly language experience is required. Experience with dig- 
ital design techniques and C language programming experience is desirable. 
The following topics are covered in the TMS320C3x workshop: 

□ TMS320C3x architecture/instruction set 

□ Use of the PC-based TMS320C3x software simulator and EVM 

□ Floating-point and parallel operations 

□ Use of the TMS320C3x assembler/linker 

□ C programming environment 

□ System architecture considerations 

□ Memory and I/O interfacing 

□ TMS320C3x development support 

For registration, pricing, or enrollment information on this and other TTO 
TMS320 workshops, call (800) 336-5236, ext. 3904. 
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B.2 TMS320C3x Part Ordering Information 

This section provides the device and support tool part numbers. Table B-1 
lists the part numbers for the TMS320C30 and TMS320C31 ; Table B-2 gives 
ordering information for TMS320C3x hardware and software support tools. An 
explanation of the TMS320 family device and development support tool prefix 
and suffix designators follows the two tables to assist in understanding the 
TMS320 product numbering system. 



Table B-1. TMS320C3x Digital Signal Processor Part Numbers 



Device 


Technology 


Operating 
Frequency 


Package Type 


Typical Power 
Dissipation 


TMS320C30GEL 


0.8-|im CMOS 


33 MHz 


Ceramic 181 -pin PGA 


1.00 w 


TMS320C30GEL27 


0.8-jxm CMOS 


27 MHz 


Ceramic 181 -pin PGA 


0.875 W 


TMS320C30GEL40 


0.8-^im CMOS 


40 MHz 


Ceramic 181 -pin PGA 


1.25 W 


TMS320C30PPM40 


0.8-^im CMOS 


40 MHz 


Plastic 208-pin QFP 


0.85 W 


TMS320C31 PQL/PQA 


0.8-(xm CMOS 


33 MHz 


Plastic 132-pin QFP 


0.75 W 




u.o-pim umuo 


97 MH-7 




U.oU VV 


TMS320C31PQL40 


0.8-|im CMOS 


40 MHz 


Plastic 132-pin QFP 


0.90 W 


TMS320LC31 PQL 


0.8-|im CMOS 


33 MHz 


Plastic 132-pin QFP 


0.50 W 


TMS320C31PQL50 


0.8-jxm CMOS 


50 MHz 


Plastic 132-pin QFP 


1.00 W 


SMJ320C316FA27 
SMJ320C31HF627 
SMJ320C316FA33 
SMJ320C316HF633 


0.8-jxm CMOS 


28 MHz 


Ceramic 141 -pin PGA 
Ceramic 132-pin QFP 
Ceramic 141 -pin PGA 
Ceramic 132-pin PGA 


0.60 W 
0.60 W 
0.75 W 
0.75 W 


SMJ320C306BM33 
SMJ320C30HF633 


0.8-pim CMOS 


33 MHz 


Ceramic 181 -pin PGA 
Ceramic 196-pin QFP 


1.10W 


SMJ320C30GBM28 

SMJ320C30HF628 

SMJ320C30HTM28 


0.8-jim CMOS 


28 MHz 


Ceramic 181 -pin PGA 
Ceramic 196-pin QFP 


1.00 W 
1.00 W 


SMJ320C30GBM25 

SMJ320C30HF625 

SMJ320C30HTM25 


0.8-jxm CMOS 


25 MHz 


Ceramic 181 -pin PGA 
Ceramic 196-pin QFP 


1.00 W 
1.00W 
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Table B-2. TMS320C3x Support Tool Part Numbers 


Tool Description 


Operating System 


Part Number 


(a) Software 


C Compiler & Macro Assembler/ Linker 


VAX/VMS 


TMDS3243255-08 




PC-DOS/MS-DOS 


TMDS3243855-02 




SPARC (Sun OS)t 


TMDS3243555-08 


Assembler/Linker 


PC-DOS/MS-DOS; OS/2 


TMDS3243850-02 


Simulator 


VAX VMS 


TMDS3243251-08 




PC-DOS/MS-DOS 


TMDS3243851-02 




SPARC (SUN OS) T 


TMDS324355 1 -09 


Tartan Floating-Point Library 


PC-DOS 


320 FLO-PC C30 




SPARC (Sun OS) 


320 FLO-Sun-C30 


Digital Filter Design Package 


PC-DOS 


DFDP 


Tartan C++ Compiler/Debugger 


PC-DOS; OS/2, Wiredown 


TAR-CCM-PC-C3X 




SPARC (Sun OS) 


TAR-CCM-SP-C3X 


Tartan C++ Compiler 


PC-DOS; OS/2, Wiredown 


TAR-SI M-PC-C3x 




SPARC (Sun OS) 


TAR-SI M-SP-C3X 


TMS320C3x Emulation Porting Kit 




TMSX3240030 


(b) Hardware 


XDS510 Emulator 


PC/MS-DOS 


TMDS3260131 


Evaluation Module (EVM) 


PC-DOS/MS-DOS 


TMDX3260030 



t Note that SUN UNIX supports TMS320C3X software tools on the 68000 family-based SUN-3 series workstations and on the 
SUN-4 series machines that use the SPARC processor, but not on the SUN-386I series of workstations. 



B.2.1 Device and Development Support Tool Prefix Designators 

Prefixes to Tl part numbers designate phases in the product's development 
stage for both devices and support tools, as shown in the following definitions: 

Device Development Evolutionary Flow 

□ TMX: Experimental device that is not necessarily representative of the fi- 
nal device's electrical specifications 

□ TMP: Final silicon die that conforms to the device's electrical specifica- 
tions but has not completed quality and reliability verification 

□ TMS: Fully qualified production device 

Support Tool Development Evolutionary Flow 

□ TMDX: Development support product that has not yet completed Tl's in- 
ternal qualification testing for development systems 

□ TMDS: Fully qualified development support product 
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TMX and TM P devices and TM DX development support tools are shipped with 
the following disclaimer: 

"Developmental product is intended for internal evaluation purposes." 



Note: Prototype Devices 

Tl recommends that prototype devices (TMX or TMP) not be used in produc- 
tion systems because their expected end-use failure rate is undefined but 
predicted to be greater than standard qualified production devices. 



TMS devices and TM DS development support tools have been fully character- 
ized, and their quality and reliability have been fully demonstrated. Tl's stan- 
dard warranty applies to TMS devices and TMDS development support tools. 

TMDX development support products are intended for internal evaluation pur- 
poses only. They are covered by Tl's Warranty and Update Policy for Micropro- 
cessor Development Systems products; however, they should be used by cus- 
tomers only with the understanding that they are developmental in nature. 



B.2.2 Device Suffixes 

The suffix indicates the package type (for example, N, FN, or GE) and temper- 
ature range (for example, L). 

Figure B-1 presents a legend for reading the complete device name for any 
TMS320 family member. 
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Figure B~1. TMS320 Device Nomenclature 



TMS 320 



30 GE 



Prefix 

TMX= Experimental Device 
TMP« Prototype Device 
TMS* Qualified Device 
SMJ * MIL-STD-883C 



Device Family 

320 -TMS320 Family 



Technology 

C = CMOS 
E x CMOS EPROM 
P « OTPEPROM 
No Letter - NMOS 



Device 

let-generation DSP: 
10 
14 
15 
16 
17 

2nd-generation DSP: 
20 
25 
26 

3rd-generation DSP: 
30 
31 

4th-generation DSP: 
40 

5th-generation DSP: 
50 
51 



L 



Temperature Range 

H « 0to50°C 
L « 0to70°C 
S « -55to100°C 
Mx -55to125°C 
At. -40 to 85°C 



- Package Type 


N 


as 


Plastic DIP 


JD 




Ceramic DIP Side-Brazed 


FN 


3 


Plastic Leaded CC 


6B 


as 


Ceramic PGA 


FJ 


as 


Ceramic Leaded CC 


FD 




Leadless Ceramic CC 


rc 




Ceramic Leaded CC 


GE 


s 


Ceramic PGA, Glass Seal 


HU 




Ceramic Quad Flatpack 


HT 


m 


Ceramic Quad Flatpack 






(gull wing) 


PQ 


X 


Plastic Quad Flatpack 



t See electrical specifications for TMS320C31 PQA case temperature ratings. 
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Quality and Reliability 



The quality and reliability of Texas Instruments (Tl) microprocessor and 
microcontroller products, which include TMS320 digital signal processors, re- 
lies on feedback from the following: 

□ Our customers, 

□ Our total manufacturing operation from front-end wafer fabrication to final 
shipping inspection, and 

□ Product quality and reliability monitoring. 

Our customer's perception of quality is the governing criterion forjudging per- 
formance. This concept is the basis for Tl Corporate Quality Policy, which is 
as follows: 

"For every product or service we offer, we shall define the requirements that 
solve the customer's problems, and we shall conform to those requirements 
without exception." 

Texas Instruments has developed a leadership reliability qualification system, 
based on years of experience with leading-edge memory technology and on 
years of research into customer requirements. To achieve constant improve- 
ment, programs that support that system respond to customer input and inter- 
nal information. 

This appendix presents the following major topics: 

Topic Page 

C.1 Reliability Stress Tests C-2 

C.2 TMS320C31 PQFP Ref low Soldering Precautions ................ C-7 



Reliability Stress Tests 



C.1 Reliability Stress Tests 

Accelerated stress tests are performed on new semiconductor products and 
process changes to qualify them and ensure excellence in product reliability. 
The following test environments are typical: 

□ High-temperature operating life 

□ Storage life 

□ Temperature cycling 

□ Biased humidity 

□ Autoclave 

□ Electrostatic discharge 

□ Package integrity 

□ Electromigration 

□ Channel-hot electrons (performed on geometries less than 2.0 jim) 

Typical events or changes that require internal requalification of a product in- 
clude the following: 

□ New die design, shrink, or layout 

□ Wafer process (baseline/control systems, flow, mask, chemicals, gases, 
dopants, passivation, or metal systems) 

□ Packaging assembly (baseline control systems or critical assembly equip- 
ment) 

□ Piece parts (such as lead frame, mold compound, mount material, bond 
wire, or lead finish) 

□ Manufacturing site 

Tl reliability control systems extend beyond qualification. Total reliability con- 
trols and management include product reliability monitoring as well as final 
product release controls. MOS memories, utilizing high-density active ele- 
ments, serve as the leading indicator in wafer-process integrity at Tl MOS fab- 
rication sites, enhancing all MOS logic device yields and reliability. Tl places 
more than several thousand MOS devices per month on reliability tests to en- 
sure and sustain built-in product excellence. 

Table C-1 lists the microprocessor and microcontroller reliability tests, the du- 
ration of the test, and sample size. Table C-2 contains definitions and descrip- 
tions of terms used in those tests. 
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Table C-1. Microprocessor and Microcontroller tests 



Sample Size 

Test Duration Plastic Ceramic 



Operating life, 1 25° C, 5.0 V 1 000 hrs 1 29 1 29 

Storage life, 150° C 1000 hrs 45t 45 

Biased humidity, 85° C/85 percent 1 000 hrs 77 
RH.5.0V 

Autoclave, 1 21 ° C, 1 ATM 240 hrs 45 

Temperature cycle, -65 to 1 50° C 1 000 eye* 77 77 

Temperature cycle, 0 to 1 25° C 3000 eye 77 77 

Thermal shock,-65 to 150° C 200 eye 77 77 

Electrostatic discharge, ± 2 kV 15 15 

Latch-up (CMOS devices only) 5 5 

Mechanical sequence - 22 

Thermal sequence - 22 

Thermal/mechanical sequence - 22 

PIND - 45 

Internal water vapor - 3 

Severability 22 22 

Solder heat 22 22 

Resistance to solvents 15 15 

Lead integrity 15 15 

Lead pull 22 - 

Lead finish adhesion 15 15 

Salt atmosphere 15 15 

Flammability (UL94-V0) 3 

Thermal impedance 5 5 

t If junction temperature does not exceed plasticity of package 
X For severe environments; reduced cycles for office environments 
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Table C-2. Definitions of Microprocessor Testing Terms 



Term 



Definition/Description 



References 



Average Outgoing Quality (AOQ) 



Failure in Time (FIT) 



Operating Life 



Storage Life 
Biased Humidity 



Autoclave (Pressure Cooker) 



Temperature Cycle 



Electrostatic Discharge 



Amount of defective product in a popu- 
lation, usually expressed in terms of 
parts per million (PPM). 

Estimated field failure rate In number of 
failures per billion power-on device 
hours; 1000 FITS equal 0.1 percent fail- 
ure per 1000 device hours. 

Device dynamically exercised at a high 
ambient temperature (usually 125° C) to 
simulate field usage that would expose 
the device to a much lower ambient 
temperature (such as 55° C). Using a 
derived high temperature, a 55°C ambi- 
ent failure rate can be calculated. 

Device exposed to 150° C unbiased 
condition. Bond integrity is stressed in 
this environment. 

Moisture and bias used to accelerate 
corrosion-type failures in plastic pack- 
ages. Conditions include 85° C ambient 
temperature with 85% relative humidity 
(RH). Typical bias voltage is +5V and is 
grounded on alternating pins. 

Plastic-packaged devices exposed to 
moisture at 121° C using a pressure of 
one atmosphere above normal pres- 
sure. The pressure forces moisture per- 
meation of the package and acceler- 
ates corrosion mechanisms (if present) 
on the device. External package con- 
taminants can also be activated and 
caused to generate inter-pin current 
leakage paths. 

Device exposed to severe temperature 
extremes in an alternating fashion (-65° 
C for 1 5 minutes and 1 50° C for 1 5 min- 
utes per cycle) for at least 1000 cycles. 
Package strength, bond quality, and 
consistency of assembly process are 
tested in this environment. 

Device exposed to electrostatic 
discharge pulses. Calibration is accord- 
ing to MIL STD 883C, method 3015.6. 
Devices are stressed to determine fail- 
ure threshold of the design. 
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Table C-2. Definitions of Microprocessor Testing Terms (Continued) 



Term 



Definition/Description 



References 



Thermal Shock 



Particle Impact Noise Detection 
(PIND) 

Mechanical Sequence 



Thermal Sequence 



Thermal/Mechanical Sequence 



Test similar to the temperature cycle 
test, but involving a liquid-to-liquid 
transfer. 

A nondestructive test to detect loose 
particles inside a device cavity. 

Fine and gross leak 
Mechanical shock 

PIND (optional) 

Vibration, variable frequency 

Constant acceleration 
Fine and gross leak 
Electrical test 

Fine and gross leak 
Solder heat (optional) 
Temperature cycle 
(10 cycles minimum) 
Thermal shock 
(10 cycles minimum) 
Moisture resistance 
Fine and gross leak 
Electrical test 

Fine and gross leak 
Temperature cycle 
(10 cycles minimum) 
Constant acceleration 

Fine and gross leak 
Electrical test 
Electrostatic discharge 
Severability 
Solder heat 

Salt atmosphere 

Lead pull 
Lead integrity 



Electromigration 



Resistance to solvents 



MIL-STD-883C, Method 1011 



MIL-STD-883C, Method 1014 
MIL-STD-883C, Method 2002, 
1500 g, 0.5 ms, Condition B 
MIL-STD-883C, Method 2020 
MIL-STD-883C, Method 2007, 
20g, Condition A 
MIL-STD-883C, Method 2001 
MIL-STD-883C, Method 1014 
To data sheet limits 

MIL-STD-883C, Method 1014 
MIL-STD-750C, Method 1014 
MIL-STD-883C, Method 1010, 
-65 to + 150 °C, Condition C 
MIL-STD-883C, Method 1011, 
-55 to +125 °C, Condition B 
MIL-STD-883C, Method 1004 
MIL-STD-883C, Method 1014 
To data sheet limits 

MIL-STD-883C, Method 1014 
MIL-STD-883C, Method 1010, 
-65 to +150 °C, Condition C 
MIL-STD-883C, Method 2001, 
30 kg, Y1 Plane 
MIL-STD-883C, Method 1014 
To data sheet limits 
MIL-STD-883C, Method 3015 
MIL-STD-883C, Method 2033 
MIL-STD-750C, Method 2031, 
1 0 sec 

MIL-STD-883C, Method 1009, 
Condition A, 24 hrs min 
MIL-STD-883C, Method 2004, 
Condition A 

MIL-STD-883C, Method 2004, 
Condition B1 

Accelerated stress testing of 
conductor patterns to ensure 
acceptable lifetime of power- 
on operation 

MIL-STD-883C, Method 2015 
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Table C-3 lists the TMS320C3x devices, the approximate number of transis- 
tors, and the equivalent gates. The numbers have been determined from de- 
sign verification runs. 

Table C-3. TMS320C3x Transistors 

Device # Transistors # Gates 

CMOS: TMS320C30 600K-700K 200K 

CMOS: TMS320C31 500K-600K 100K 



i 1 1 1 i 

Note: MOS Semiconductors 

Texas Instruments reserves the right to make changes in MOS semiconduc- 
tor test limits, procedures, or processing without notice. Unless prior ar- 
rangements for notification have been made, T! advises all customers to r©= 
verify current test and manufacturing conditions prior to relying on published 
data. 

i * 
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C.2 TMS320C31 PQFP Reflow Soldering Precautions 

Recent tests have identified an industry-wide problem experienced by sur- 
face-mounted devices exposed to reflow soldering temperatures. This prob- 
lem involves a package-cracking phenomenon sometimes experienced by 
large (for example, 132-pin) plastic quad flat pack (PQFP) packages during 
surface-mount manufacturing. This phenomenon occurs if the TMS320C31 
PQA or PQL is exposed to uncontrolled levels of humidity prior to reflow solder. 
This moisture can flash to steam during solder reflow and cause sufficient 
stress to crack the package and compromise device integrity. Once the device 
is soldered or socketed into the board, no special handling precautions are re- 
quired. 

To minimize moisture absorption, Tl ships the TMS320C31 PQA or PQL in dry 
pack shipping bags with a relative humidity (RH) indicator card and moisture- 
absorbing desiccant. These moisture-barrier shipping bags will adequately 
block moisture transmission to allow shelf storage for 1 2 months from date of 
seal when stored at less than 60% RH and less than 30° C. Devices may be 
stored outside the sealed bags indefinitely if stored at less than 25% RH and 
less than 30° C. 

Once the bag seal is broken, the devices should, within two days of removal, 
be reflow soldered and stored at less than 60% RH and less than 30° C. If these 
conditions are not met, Tl recommends baking the devices in a clean oven at 
125° C and 10% maximum RH for 25 hours. This procedure restores the de- 
vices to their dry-packed moisture level. 

i t i 
Note: ESD Precautions 

Shipping tubes will not withstand the 1 25° C baking process. Before baking, 
transfer the devices to a metal tray or tube. Follow standard ESD precau- 
tions. 



Tl recommends that the reflow process not exceed two solder cycles and that 
the temperature not exceed 220° C. 

If you have questions or concerns, please contact your local Tl representative. 
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Calculation of TMS320C30 Power Dissipation 



The TMS320C30 is a state-of-the-art, high-performance, 32-bit floating-point 
digital signal processing (DSP) microprocessor fabricated in CMOS 
technology. This device is the first member of the third generation of TMS320 
family single-chip DSP microprocessors. Since 1982, when the first-genera- 
tion TMS32010 was introduced, the TMS320 family has established itself as 
the industry standard for DSP. The TMS320C30's innovative architecture and 
specialized instruction set provide high-speed and increased flexibility for DSP 
applications. This combination makes it possible to execute up to 40 million 
floating point operations per second (MFLOPS). 

As device sophistication and levels of integration increase with evolving semi- 
conductor technologies, actual levels of power dissipation vary widely and de- 
pend heavily on the particular application in which the device is used and the 
nature of the program being executed. In addition, due to the inherent charac- 
teristics of CMOS technology, power requirements vary according to clock 
rates and data values being processed. 

This appendix presents the information necessary to determine TMS320C30 
power supply current requirements under different operating conditions. With 
this information, you can determine the device's power dissipation, which, in 
turn, you can use to calculate thermal management requirements. 



This appendix discusses the following major topics: 

Topic Page 

D.1 Fundamental Power Dissipation Characteristics D-2 

D.2 Current Requirement for Internal Circuitry ......... . . . , D-5 

D.3 Current Requirement for Output Driver Circuitry . . . . D-9 

D.4 Calculation of Total Supply Current . ...... — D-1 8 

D.5 Example Supply Current Calculations . . . . D-26 

D.6 Summary . . ...... ....... D-28 

D.7 Photo of l DD for FFT D-29 

D.8 FFT Assembly Code D-30 
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D.1 Fundamental Power Dissipation Characteristics 

Typically, an IC's (integrated circuit) power specification is expressed as a 
function of operating frequency, supply voltage, operating temperature, and 
output load. As devices become more complex, the specification must also be 
based on device functionality. CMOS devices inherently draw current only dur- 
ing switching through the linear region. Therefore, the power supply current 
is related to the rate of switching. Furthermore, since the output drivers of the 
TMS320C30 are specified to drive direct current (DC) loads, the power supply 
current resulting from external writes depends not only on switching rate but 
also on the value of data written. 

D.1 .1 Components of Power Supply Current Requirements 

There are four basic components of the power supply current: 

□ Quiescent, 

□ Internal Operations, 

□ Internal Bus Operations, and 

□ External Bus Operations 

D.1 .2 Dependencies 

The power supply current consumption depends on many factors. Four are 
system-related: 

□ Operating frequency, 

□ Supply voltage, 

□ Operating temperature, and 

□ Output load 

Several others are also related to TMS320C30 operation, including: 

□ Duty cycle of operations, 

□ Number of buses used, 

□ Wait states, 

□ Cache usage, and 

□ Data value 



D-2 



Fundamental Power Dissipation Characteristics 



The total power supply current for the device is described in this equation, 
which applies the four basic power supply current components and the depen- 
dencies described above: 

1 - ('q + 'iops + 'ibus + 'xbus) x ™ x T 
where 

Iq is the quiescent current component, 

Ijops [S ^ e current component due to internal operations, 

'ibus ,s the current component due to internal bus usage, including data value 
and cycle time dependencies, 

•xbus is the current component due to external bus usage, including data 
value, wait state, cycle time, and capacitive load dependencies, 

FV is a scale factor for frequency and supply voltage, and 

T is a scale factor for operating temperature. 

Application of this equation and determination of all of the dependencies are 
described in detail in this appendix. 

This appendix explains, in detail, how to determine the power supply current 
requirement for the TMS320C30. If a less detailed analysis is sufficient, the 
minimum, typical, and maximum values can be used to determine a rough esti- 
mate of the power supply current requirements. The minimum power supply 
current requirement is 110 mA. The typical and average current consumption 
is 200 mA, as described in the TMS320C30 data sheet, and will be associated 
with most algorithms running on the device unless data output is excessive. 



Maximum Current Requirement 

The maximum current requirement is 600 mA and occurs only 
under worst case conditions: namely, writing alternating data 
(AAAAAAAAh to 55555555h) out of both external buses 
simultaneously, every cycle, with 80 pF loads and running at 33 
MHz. 



If an extremely conservative approach is desired, the maximum value can be 
used. 
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D.1 .3 Determining Algorithm Partitioning 

Each part of an algorithm behaves differently, depending on its internal and ex- 
ternal bus usage. To analyze the power supply current requirement, you must 
partition an algorithm into segments with distinct concentrations of internal or 
external bus usage. The analysis that follows is applied to each distinct pro- 
gram segment to determine the power supply current requirement for that sec- 
tion. The average power supply current requirement can then be calculated 
from the requirements of each segment of the algorithm. 



D.1 .4 Test Setup Description 

All TMS320C30 supply current measurements were performed on the test set- 
up shown in Figure D-1 . The test setup consists of a TMS320C30, 8K words 
of zero-wait-state Cypress Semiconductor SRAMs (CY7C186— 25PC), and 
RC loads on all data and address lines. A Tektronix Current Probe (P6042) 
measures the power supply current in all Vqd lines of the device. The supply 
voltage on the output load is 2.1 5 V. Unless otherwise specified, all measure- 
ments are made at a supply voltage of 5.0 V, an input clock frequency of 33 
MHz, a capacitive load of 80 pF, and an operating temperature of 25°C. 



Figure D-1. Current Measurement Test Setup 



CY7C186-25PC 



SRAM 



2.15V 
R = 825 Q 



32 D 



+ VDD 



Tektronix 
Current Probe 
(P6042) 



24 A' 



V D D 

TMS320C30 
Primary Expansion 

vss 



2.15V 



R = 825 Q 
32 D , 



13 A ' 
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D.2 Current Requirement for Internal Circuitry 

The power supply current requirement for internal circuitry consists of three 
components: quiescent, internal operations, and internal bus operations. 
Quiescent and internal operations are constants, but the internal bus opera- 
tions component varies with the rate of internal bus usage and the data values 
being transferred. 

D.2.1 Quiescent 

Quiescent refers to the baseline supply current drawn by the TMS320C30 dur- 
ing minimal internal activity, such as executing the IDLE instruction or branch- 
ing to self. It includes the current required to fetch an instruction from on- or 
off-chip memory. The quiescent requirement for the TMS320C30 is 110 mA. 
Examples of quiescent current include: 

□ Maintaining timers and serial ports 

□ Executing the IDLE instruction 

□ TMS320C30 in HOLD mode pending external bus access 

□ TMS320C30 in reset 

□ Branching to self 

D.2.2 Internal Operations 

Internal operations are those that require more current than quiescent activity 
but do not include external bus usage or significant internal bus usage. Internal 
operations include register-to-register multiplication, ALU operations, and 
branches. They add a constant 55 mA above the quiescent so that the total 
contribution of quiescent and internal operations is 1 65 mA. Note, however, 
that internal and/or external bus operations executed via an RPTS instruction 
do not contribute an internal operations power supply current component and 
hence do not add 55 mA to quiescent current. During an instruction in RPTS, 
activity other than the instruction being repeated is suspended; therefore, 
power supply current is related only to the operation performed by the instruc- 
tion being executed. The next contributing factor to the power supply current 
requirement is internal bus operations. 
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D.2.3 Internal Bus Operations 

The internal bus operations include all operations that utilize the internal buses 
extensively, such as accessing internal RAM every cycle. No distinction is 
made between internal reads (such as instruction or operand fetches from in- 
ternal ROM or internal RAM banks) and internal writes (such as operand 
stores to internal RAM banks), because internally they are equal. Significant 
use of internal buses adds a term to the power supply current requirement that 
is data-dependent. Since switching requires more current, moving changing 
data at high rates requires higher power supply current. 

Pipeline conflicts, use of cache, fetches from external wait-state memory, and 
writes to external wait-state memory all affect the internal and external bus 
cycles of an algorithm executing on the TMS320C30. Therefore, the internal 
bus usage of the algorithm must be determined to accurately calculate power 
supply current requirements. The TMS320C30 software simulator and XDS 
emulator both provide benchmarking and timing capabilities that allow bus 
usage to be determined. 

The current resulting from internal bus usage varies roughly exponentially with 
transfer rates. Figure D-2 shows internal bus current requirements for trans- 
ferring alternating data (AAAAAAAAh to 55555555h) at several transfer rates 
(expressed as the transfer cycle time) . A transfer rate less than 1 implies multi- 
ple accesses per single H1 cycle (that is, using direct memory access (DMA), 
etc.). Transfer cycle times greater than 1 refer to single-cycle transfers with 
one or more cycles between them. The minimum transfer cycle time is one- 
third, which corresponds to three accesses in a single H1 cycle. 

The data set AAAAAAAAh to 55555555h exhibits the maximum current for 
these types of operations. Less current is required for transferring other data 
patterns, and current values can be derated accordingly as described later in 
this subsection. 

As the transfer rate decreases (that is, transfer cycle time increases), the in- 
cremental Iqd approaches 0 mA. Transfer rates corresponding to more than 
seven H1 cycles do not add any current and are considered insignificant. This 
figure represents the incremental Iqd due t0 internal bus operations and is 
added to quiescent and internal operations current values. 

For example, the maximum transfer rate corresponds to three accesses every 
cycle or one-third H1 transfer cycle time. At this rate, 85 mA is added to the 
quiescent (110 mA) and internal operation (55 mA) current values for a total 
of 250 mA. 
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IncrementalFigure D-2 shows the internal bus current requirement when tran- 
sferring As, followed by 5s, for various transfer rates. Figure D-3 shows the 
data dependence of the internal bus current requirement when the data is oth- 
er than As followed by 5s. The trapezoidal region bounds all possible data val- 
ues transferred. The lower line represents the scale factor for transferring the 
same data. The upper line represents the scale factor for transferring alternat- 
ing data (all Os to all Fs or all As to all 5s, etc.). 

Figure D-2. Internal Bus Current Versus Transfer Rate 
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Figure Internal Bus Current Versus Data Complexity Derating Curve 
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Since the possible permutations of data values is quite large, the extent to 
which data varies is referred to as relative data complexity. This term repre- 
sents a relative measure of the extent to which data values are changing and 
the extent to which the number of bits are changing state. Therefore, relative 
data complexity ranges from 0, signifying minimal variation of data, to a nor- 
malized value of 1 , signifying greatest data variation. 

If a statistical knowledge of the data exists, Figure D-3 can be used to deter- 
mine the exact power supply requirement according to internal bus usage. For 
example, Figure D-3 indicates a 63% scale factor when all Fs are moved inter- 
nally every cycle with two accesses per cycle. This scale factor is multiplied 
by 55 mA (from Figure D-2, at one-half H1 cycle transfer time), yielding 34.65 
mA because of internal bus usage. Therefore, an algorithm running under 
these conditions requires about 200 mA of power supply current (110 + 55 + 
34.65). 

Since a statistical knowledge of the data might not be readily available, a nomi- 
nal scale factor will suffice. The median between the minimum and maximum 
values at 50% relative data complexity yields a value of 0.80. This value will 
serve as an estimate of a nominal scale factor. Therefore, you can use this 
nominal data scale factor of 80% for internal bus data dependency, adding 44 
mA to 1 1 0 mA (quiescent) and 55 mA (internal operations) to yield 21 0 mA. As 
an upper bound, assume worst case conditions of three accesses of alternat- 
ing data every cycle, adding 85 mA to 1 1 0 mA (quiescent) and 55 mA (internal 
operations) to yield 250 mA. 
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D.3 Current Requirement for Output Driver Circuitry 

The output driver circuits on the TMS320C30 are required to drive significantly 
higher DC and capacitive loads than internal device logic. Therefore, they are 
designed to drive larger currents than internal devices. Because of this, output 
drivers impose higher supply current requirements than other sections of cir- 
cuitry on the device. 

Accordingly, the highest values of supply current are exhibited when external 
writes are being performed at high speed. During reads, or when the external 
buses are not being used, the TMS320C30 is not driving the data bus; this 
eliminates the most significant component of output buffer current. Further- 
more, in typical cases, only a few address lines are changing, or the whole ad- 
dress bus is static. Under these conditions, an insignificant amount of supply 
current is consumed. Therefore, when no external writes are being performed 
or when writes are performed infrequently, current due to output buffer circuitry 
can be ignored. 

When external writes are being performed, the current required to supply the 
output buffers depends on several considerations. As with internal bus opera- 
tions, current required for output drivers depends on the data being transferred 
and the rate at which transfers are being made. Additionally, output driver cur- 
rent requirements depend on the number of wait states implemented, because 
wait states affect rates at which bus signals switch. Finally, current values are 
also dependent upon external bus DC and capacitive loading. 

External operations involve writes external to the device and constitute the 
major power supply current component. The power supply current for the ex- 
ternal buses is made up of three components and is summarized in the follow- 
ing equation: 

'base + 'prim + 'exp 
where 

•base ,s the 60-mA baseline current component 
'prim ' s ^e primary bus current component 
•exp ' s the expansion bus current component 

The remainder of this section describes in detail the calculation of external bus 
current components. 
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D.3.1 Primary Bus 

The current due to primary bus writes varies roughly exponentially with both 
wait states and write cycle time. Also, current components due to output driver 
circuitry are represented as offsets from the baseline value. Since the baseline 
value is related to internal current components, negative values for current off- 
set are obtained under some circumstances. Note, however, that actual nega- 
tive current does not occur. 

As previously mentioned, to obtain accurate current values, you must first es- 
tablish timing of write cycles on the buses. To determine the rate and timings 
at which write cycles to the external buses occur, you must analyze program 
activity, including any pipeline conflicts that may exist. Information from this 
manual and the TMS320C30 emulator or simulator is useful in making these 
determinations. Note that effects from the use of cache must also be ac- 
counted for in these analyses because use of cache can affect whether in- 
structions are fetched from external memory. 

When evaluating external write activity in a given program segment, you must 
consider whether a particular level of external write activity constitutes signifi- 
cant activity. If writes are being performed at a slow enough rate, they do not 
significantly impact supply current requirements; therefore, current due to ex- 
ternal writes can be ignored. This is the case, however, only if writes are being 
performed at very slow rates on both the primary and the expansion buses. If 
writes are being performed at high speed on only one of the two external 
buses, you should still use the approach described in this section to calculate 
current requirements. 

Note that, although you obtain negative incremental current values under 
some circumstances, the total contribution for external buses, including base- 
line current, must always be positive. The reason is that, when external buses 
are used minimally, total current requirements always approach the current 
contribution due to internal components, which is solely a function of internal 
activity. This places a lower limit on current contributions resulting from the pri- 
mary and expansion buses, because the total current due to external buses 
is the sum of the 60-mA baseline value and the primary and expansion bus 
components. This effect is discussed in further detail in the rest of this subsec- 
tion. 
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When you have established bus-write cycle timing, you can use Figure D-4 
to determine the contribution to supply current due to this bus activity. 
Figure D-4 shows values of current contribution from the primary bus for vari- 
ous numbers of wait states and H1 cycles between writes. These characteris- 
tics are exhibited when writes of alternating 55555555h and AAAAAAAAh are 
being performed at a capacitive load of 80 pF per output signal line. The condi- 
tions exhibit the highest current values on the device. The values presented 
in the figure represent incremental or additional current contributed by the pri- 
mary bus output driver circuitry under the given conditions. Current values ob- 
tained from this graph are later scaled and added to several other current 
terms to calculate the total current for the device. As indicated in the figure, the 
lower curve represents the current contribution for 1 8 or more cycles between 
writes. 



Figure D-4. Primary Bus Current Versus Transfer Rate and Wait States 

Primary Bus Analysis [80 pF, As/5s] 
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Note that number of cycles between writes refers to the number of H1 cycles 
between the active portion of the write cycles a s defined in Chapter 1 3— that 
is, between H1 cycles when STRB, MSTRB, or IOSTRB and R/W (or XR/W, 
as the case may be) are low. As shown in Figure D-4, the minimum number 
of cycles between writes is 1 because with back-to-back writes there is one H1 
cycle between active portions of the writes. 

To further illustrate the relationship of current and write cycle time, Figure D-5 
shows the characteristics of current for various numbers of cycles between 
writes for zero wait states. The information on this curve can be used to obtain 
more precise values of current if zero wait states are being used and the num- 
ber of cycles between writes does not fall on one of the curves in Figure D-4. 
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Figure D-5. Primary Bus Current Versus Transfer Rate at Zero Wait States 

Primary Bus Duty Cycle Analysis [80 pF, As/5s] 
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Note that, although these graphs contain negative current values, negative 
current has not necessarily actually occurred. The negative values exist be- 
cause the graphs represent a current offset from a common baseline current 
value, which is not necessarily the lowest current exhibited. Using this ap- 
proach to depict current contributions due to different components simplifies 
current calculations because it allows calculations to be made independently. 
Independent calculations are possible because information about relation- 
ships between different sections of the device are included implicitly in the in- 
formation for each section. 

Figure D-4 and Figure D-5 show that the contribution of writes for external 
bus activities becomes insignificant if writes are being performed at intervals 
of more than 1 8 cycles. Under these conditions, you should use the incremen- 
tal value of -30-mA current contribution due to the primary bus. Note, however, 
that you should use a value of -30 mA only if the expansion bus is being used 
extensively. This is because the total contribution for external buses, including 
baseline current, must always be positive. If the expansion bus is not being 
used and the primary bus is being used minimally, the current contribution due 
to the primary bus must always be greater than or equal to 20 mA. This ensures 
that the correct total current value is obtained when summing external bus 
components. Once a current value has been obtained from Figure D-4 or 
Figure D-5, this value can, if necessary, be scaled by a data dependency fac- 
tor, as described at the end of this section. This scaled value is then summed 
along with several other current terms to determine the total supply current. 
Calculation of total supply current is described in detail in Section D.4 on page 
D-18. 
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D.3.2 Expansion Bus 

Currents due to the primary and expansion buses are similar in characteristics 
but differ slightly because of several factors, including the fact that the expan- 
sion bus has 11 fewer address outputs than the primary bus (13 rather than 
24). This difference is exhibited in an overall current contribution that is slightly 
lower from the expansion bus than from the primary bus. 

Accordingly, determination of expansion bus current follows the same basic 
premises as determination of the primary bus current. Figure D-6 and 
Figure D-7 show the same current relationships for the expansion bus as 
Figure D-4 and Figure D-5 show for the primary bus. Also, since the total ex- 
ternal buses' current contributions must be positive, if the primary bus is not 
being used and the expansion bus is being used minimally, then the minimum 
current contribution due to the expansion bus is -30 mA. Finally, as with the 
primary bus, current values obtained from these figures may require scaling 
by a data dependency factor, as described in subsection D.3.3 on page D-1 4. 

Figure D-6. Expansion Bus Current Versus Transfer Rate and Wait States 

Expansion Bus Analysis [80 pF, As/5s] 
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Figure D-7. Expansion Bus Current Versus Transfer Rate at Zero Wait States 

Expansion Bus Duty Cycle Analysis [80 pF, As/5s] 
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D.3.3 Data Dependency 

Data dependency of current for the primary and expansion buses is expressed 
as a scale factor that is a percentage of the maximum current exhibited by ei- 
ther of the two buses. Data dependencies for the primary and expansion buses 
are shown in Figure D-8 and Figure D-9, respectively. 

These two figures show normalized weighting factors that you can use to scale 
current requirements on the basis of patterns in data being written on the exter- 
nal buses. The range of possible weighting factors forms a trapezoidal pattern 
bounded by extremes of data values. As can be seen from Figure D-8 and 
Figure D-9, the minimum current is exhibited by writing all 0s, while the maxi- 
mum current occurs when writing alternating 55555555h and AAAAAAAAh. 
This condition results in a weighting factor of 1 , which corresponds to using the 
values from Figure D-4 and/or Figure D-5 directly. 

As with internal bus operations, data dependencies for the external buses are 
well defined, but accurate prediction of data patterns is often either impossible 
or impractical. Therefore, unless you have precise knowledge of data patterns, 
you should use an estimate of a median or average value for scale factor. If 
you assume that data will be neither 5s and As nor all 0s and will be varying 
randomly, a value of 0.85 is appropriate. Otherwise, if you prefer a conserva- 
tive approach, you can use a value of 1 .0 as an upper bound. 
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Regardless of the approach you take for scaling, once you determine the scale 
factors for primary and expansion buses, apply these factors to scale the cur- 
rent values found by using the graphs in the previous two subsections. For ex- 
ample, if a nominal scale factor of 0.85 is used and the system uses zero wait 
states with two cycles between accesses on both the primary and expansion 
buses, the current contribution from the two buses is as follows: 



Primary: 
Expansion: 



0.85 x 80 mA = 68 mA 
0.85 x 40 mA = 34 mA 



Figure D-8. Primary Bus Current Versus Data Complexity Derating Curve 

Primary Bus Data Dependency Analysis [80 pF] 
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Figure D-9. Expansion Bus Current Versus Data Complexity Derating Curve 

Expansion Bus Data Dependency [80 pF] 
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D.3.4 Capacitive Load Dependence 

Once you account for cycle timing and data dependencies, you should include 
capacitive loading effects in a manner similar to that of data dependency. 
Figure D-1 0 shows the scale factor to be applied to the current values 
obtained above as a function of actual load capacitance if the load capacitance 
presented to the buses is less than 80 pF. 

In the previous example, if the load capacitance is 20 pF instead of 80 pF, a 
scale factor of 0.84 is used, yielding: 

Primary: 0.84 x 68 mA = 57. 1 2 mA 
Expansion: 0.84 x 34 mA = 28.56 mA 

The slope of the load capacitance line in Figure D-1 0 is 0.26% normalized Iqd 
per pF. While this slope may be used to interpolate scale factors for loads 
greater than 80 pF, the TMS320C30 is specified to drive output loads of less 
than 80 pF, and interface timings cannot be guaranteed at higher loads. With 
data dependency and capacitive load scale factors applied to the current val- 
ues for primary and expansion buses, the total supply current required for the 
device for a particular application can be calculated, as described in the next 
section. 
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Figure D-10. Current Versus Output Load Capacitance 
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DA Calculation of Total Supply Current 

The previous sections have discussed currents contributed by several 
sources on the TMS320C30. Because determinations of actual current values 
are unique and independent for each source, each current source was dis- 
cussed separately. In an actual application, however, the sum of the indepen- 
dent contributions from each current determines the total current requirement 
for the device. This total current value is exhibited as the total current supplied 
to the device through all of the Vqd inputs and returned through the V§s con- 
nections. 

Note that numerous Vqd an d Vss P' ns on the device are routed to a variety of 
internal connections, not all of which are common. Externally, however, all of 
these pins should be connected in parallel to 5 V and ground planes, respec- 
tively, with as low impedance as possible. 

As mentioned previously, because different program segments inherently per- 
form different operations that are often quite distinct from each other, it is typi- 
cally appropriate to consider current for each of the different segments inde- 
pendently. Once this is done, peak current requirements are readily obtained. 
Further, you can use average current calculations to determine heating effects 
of power dissipation. In turn, you can use these effects to determine thermal 
management considerations. 

D.4.1 Combining Supply Current Due to All Components 

To determine the total supply current requirements for any given program ac- 
tivity, calculate each of the appropriate components and combine them in the 
following sequence: 

1 ) Start with 1 1 0-mA quiescent current requirement. 

2) Add 55 mA for internal operations unless the device is dormant, as during 
execution of IDLE, NOPs, or branches-to-self, or performance of internal 
and/or external bus operations using an RPTS instruction (see subsection 
D.2.2 on page D-5). Internal or external bus operations executed via 
RPTS do not contribute an internal operations power supply current com- 
ponent and hence do not add 55 mA to quiescent current. Therefore, cur- 
rent components in the next two steps might still be required, even though 
the 55 mA is omitted. 
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3) If significant internal bus operations are being performed (see subsection 
D.2.2 on page D-5), add the calculated current value. 

4) If external writes are being performed at high speed (see section D.3 on 
page D-9), add 60 mA and then add the values calculated for primary and 
expansion bus current components. If only one external bus is being used, 
the appropriate incremental current for the unused bus should still be in- 
cluded because the current offsets include components required for oper- 
ating both buses. Note, however, that, as discussed previously, the total 
current contribution for external buses, including baseline, must always be 
positive. 

The current value resulting from summing these components is the total de- 
vice current requirement for a given program activity. 

D.4.2 Supply Voltage, Operating Frequency, and Temperature Dependencies 

Current dependencies specific to each supply current component (such as in- 
ternal or external bus operations) are discussed in subsection D.1 .2 on page 
D-2. Supply voltage level, operating temperature, and operating frequency 
affect requirements for the total supply current and must be maintained within 
required device specifications. 

Once the total current for a particular program segment has been determined, 
the dependencies that affect total current requirements are applied as a scale 
factor in the same manner as data dependencies discussed in other sections. 
Figure D-1 1 shows the relative scale factors to be applied to the supply current 
values as a function of both Vqd and operating frequency. 

Power supply current consumption does not vary significantly with operating 
temperature. However, if desired, a scale factor of 2% normalized Idd per 50°C 
change in operating temperature may be used to derate current within the spe- 
cified range noted in the TMS320C30 data sheet. This temperature depen- 
dence is shown graphically in Figure D-1 2. Note that a temperature scale fac- 
tor of 1 .0 corresponds to current values at 25°C, which is the temperature at 
which all other references in the document are made. 
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Figure D-11. Current Versus Frequency and Supply Voltage 

Iqd Versus f(CLKIN) and Supply Voltage 
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Figure D-12. Current Versus Operating Temperature Change 
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D.4.3 Design Equation 



The procedure for determining the power supply current requirement can be 
summarized in the following equation: 

1 - ('q + 'iops + 'ibus + 'xbus) x FV x T 
where 

l q = 110 mA 

'iops = 55 mA 

'•bus = D 1 x f 1 ( see Table 
'xbus = 'prim + 'exp 
with 

'base = 60 mA 

'prim = D 2 x C 2 x f 2 ( see Table 

'exp = D 3 x C 3 x f 3 ( see Table D ~ 1 ) 

FV is the scale factor for frequency and supply voltage, and 

T is the scale factor for operating temperature. 

Table D-1 describes the symbols used in the power supply current equation. 
The table displays figure numbers from which the value can be obtained. 
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Table D-1. Current Equation Symbols 



Symbol 


Description 


Graph/Value 


'q 


Quiescent Current 


110 mA 


^iops 


Internal Operations Current 


55 mA 


'ibus 


Internal Bus Operations Current 




Di 


Internal Bus Data Scale Factor 


Figure D-3 


fl 


Internal Bus Current Requirement 


Figure D-2 


'xbus 


External Bus Operations Current 




i 

'base 


External Bus Base Current 


60 mA 


i 

'prim 


Primary Bus Operations Current 


+ 

i 


D 2 


Primary Bus Data Scale Factor 


Figure D— 8 


c 2 


Primary Bus Cap Load Scale Factor 


rigure u— iu 


h 


Primary Bus Current Requirement 


Figure D-4 or 
rigure u—o 
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'exp 


expansion dus uperanons v/urrem 


t 


n„ 
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c 3 


Expansion Bus Cap Load Scale Factor 


Figure D-10 


fa 


Expansion Bus Current Requirement 


Figure D-6 or 
Figure D-7 


FV 


Freq/Supply Voltage Scale Factor 


Figure D— 11 


T 


Temperature Scale Factor 


Figure D-1 2 



t See equation in subsection D.4.3 on page D-21 . 

D.4.4 Peak Versus Average Current 

If current is observed over the course of an entire program, some segments 
will usually exhibit significantly different levels of current required for different 
durations of time. For example, a program may spend 80% of its time perform- 
ing internal operations, drawing a current of 250 mA, and spend the remaining 
20% of its time performing writes at full speed to the expansion bus, drawing 
300 mA. 

While knowledge of peak current levels is important in order to establish power 
supply requirements, some applications require information about average 
current. This is particularly significant if periods of high peak current are short 
in duration. Average current can be obtained by performing a weighted sum 
of the currents due to the various independent program segments over time. 
In the example above, the average current can be calculated as follows: 

I = 0.8 x 250 mA + 0.2 x 300 mA = 260 mA 

Using this approach, average current for any number of program segments 

can be calculated. 
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D.4.5 Thermal Management Considerations 



Heating characteristics of the TMS320C30 depend on power dissipation, 
which in turn depends on power supply current. When you make thermal man- 
agement calculations, you must consider the manner in which power supply 
current contributes to power dissipation and to the time constant of the 
TMS320C30 package thermal characteristics. 

Depending on sources and destinations of current on the device, some current 
contributions to Iqd do not constitute a component of power dissipation at 5 
volts. Accordingly, if you use the total current flowing into Vqd to calculate pow- 
er dissipation at 5 volts, you will obtain erroneously large values for power dis- 
sipation. Power dissipation is defined as: 

P = I x V 

(where P is power, I is current, and V is voltage). If device outputs are driving 
any DC load to a logic high level, only a minor contribution is made to power 
dissipation because CMOS outputs typically drive to a level within a few tenths 
of a volt of the power supply rails. If this is the case, subtract these current com- 
ponents out of the total supply current value; then calculate their contribution 
to power dissipation separately and add it to the total power dissipation (see 
Figure D-13). If this is not done, these currents resulting from driving a logic 
high level into a DC load will cause unrealistically high power dissipation val- 
ues. The error occurs because the currents resulting from driving a logic high 
level into a DC load will appear as a portion of the current used to calculate 
power dissipation due to Vqd at 5 volts. 



Figure D-13. Load Currents 
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Furthermore, external loads draw supply-only current when outputs are being 
driven high, because, when outputs are in the logic 0 state, the device is sink- 
ing current that is supplied from an external source. Therefore, the power dissi- 
pation due to this current component will not have a contribution through Iqd 
but will contribute to power dissipation with a magnitude of: 



where Vql is the low-level output voltage and Iql is the current being sunk by 
the output as shown in Figure D-13. The power dissipation component due 
to outputs being driven low should be calculated and added to the total power 
dissipation. 

When outputs with DC loads are being switched, the power dissipation compo- 
nents from outputs being driven high and outputs being driven low are aver- 
aged and added to the total device power dissipation. You should calculate 
power components due to DC loading of the outputs separately for each pro- 
gram segment before you calculate average power. 

Note that any unused inputs that are left disconnected may float to a voltage 
level that will cause input buffer circuits to remain in the linear region and there- 
fore contribute a significant component to power supply current. Accordingly, 
any unused inputs should be made inactive by being either grounded or pulled 
high if absolute minimum power dissipation is desired. If several unused inputs 
must be pulled high, they may be pulled high together through one resistor to 
minimize component count and board space. 

When you use power dissipation values to determine thermal management 
considerations, you should use the average power unless the time duration of 
individual program segments is long. The thermal characteristics of the 
TMS320C30 in the 1 81 -pin grid analysis (PGA) package are exponential in na- 
ture, with a time constant t = 4.5 minutes. Therefore, when subjected to a 
change in power, the temperature of the device package will, after 4.5 minutes, 
reach approximately 63% of the total temperature change. Accordingly, if the 
time duration of program segments exhibiting high power dissipation values 
is short (on the order of a few seconds) , you can use average power, calculated 
in the same manner as average current (as described in subsection D.4.4 on 
page D-22). 

Otherwise, you should calculate maximum device temperature on the basis 
of the actual time duration of the program segments involved. For example, 
if a particular program segment lasts for seven minutes, then, using the expo- 
nential function, you can calculate that a device will reach approximately 80% 
of the temperature due to the total power dissipation during the program seg- 
ment. 
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Note that the average power should be determined by calculating the power 
for each program segment (including considerations described above) and 
performing a time average of these values, rather than simply multiplying the 
average current as determined in the previous subsection by Vqd- 

Specific device temperature calculations are made by using the TMS320C30 
thermal impedance characteristics included in Chapter 13. 
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D.5 Example Supply Current Calculations 

A Fast Fourier Transform (FFT) represents a typical DSP algorithm. The FFT 
code in Section D.8 on page D-30 processes data in the RAM blocks and 
writes the result out to zero-wait-state external SRAM on the primary bus. The 
program executes out of zero-wait-state external SRAM on the primary bus, 
and the TMS320C30's cache is enabled. The entire algorithm consists mainly 
of internal bus operations and so includes quiescent and internal operations 
in general. At the end of processing, the 1 024 results are written out on the pri- 
mary bus. Therefore, the algorithm exhibits a higher current requirement dur- 
ing the write portion, where the external bus is being used significantly. 

D.5.1 Processing 

The processing portion of the algorithm is 95% of the total algorithm. During 
this portion, the power supply current is required only for the internal circuitry. 
Data is processed in several loops that compose a majority of the algorithm. 
During these loops, two operands are transferred on every cycle. The current 
required for internal bus usage, then, is 55 mA, taken from Figure D-2 on page 
D-7. The data is assumed to be random. A data value scale factor of 0.8 is 
used from Figure D-3 on page D-7. This value scales 55 mA, yielding 44 mA 
for internal bus operations. Adding 44 mA to the quiescent current requirement 
and internal operations current requirement yields a current requirement of 
209 mA for the major portion of the algorithm. 

1 = 'q + 'iops + 'ibus 

I = 110 mA + 55 mA + (55 mA)(0.8) = 209 mA 

D.5.2 Data Output 

The portion of the algorithm corresponding to writing out data is approximately 
5% of the total algorithm. Again, the data that is being written is assumed to 
be random. From Figure D-3 on page D-7 and Figure D-8 on page D-15, 
scale factors of 0.80 and 0.85 are used for derating due to data value depen- 
dency for internal and primary buses, respectively. During the data dump por- 
tion of the code, a load and store are performed every cycle; however, the par- 
allel load/store instruction is in an RPTS loop, so there is no contribution due 
to internal operations, because the instruction is fetched only once. The only 
internal contributions are due to quiescent and internal bus operations. 
Figure D-4on page D-11 indicates a 1 70-mA current contribution due to back- 
to-back zero-wait-state writes, and Figure D-6 on page D-13 indicates a 
-80-mA contribution due to the expansion bus being idle (that is, with more 
than 1 8 H1 cycles between writes). Therefore, the total contribution due to this 
portion of the code is: 4 
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1 = 'q + 'ibus + 'xbus 
or, 

I = 110 + (55 mA)(0.8) + 60 mA - 80 mA + (170 mA)(0.85) 
= 278.5 mA 

D.5.3 Average Current 

The average current is derived from the two portions of the algorithm. The pro- 
cessing portion took 95% of the time and required about 21 0 mA, and the data 
dump portion took the other 5% and required about 280 mA. The average is 
calculated as: 

l avg = (0.95)(21 mA) + (0.05)(280 mA) = 213.5 mA 

From the thermal characteristics specified in Chapter 1 3, it can be shown that 
this current level corresponds to a case temperature of 43°C. This temperature 
meets the maximum device specification of 85°C and hence requires no 
forced air cooling. 

D.5.4 Experimental Results 

A photograph of the power supply current for the FFT is shown Section D.7 on 
page D-29. During the FFT processing, the measured current varied between 
1 80 and 220 mA. The peak of the current during external writes was 270 mA, 
and the average current requirement, as measured on a digital multimeter, 
was 200 mA. The calculations yielded results that were extremely close to the 
actual measured power supply current. 
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D.6 Summary 

An accurate power supply current requirement for the TMS320C30 cannot be 
expressed simply in terms of operating frequency, supply voltage, and output 
load capacitance. The specification must be more complete and depends on 
device functionality and system parameters. The current components related 
to device functionality are due to quiescent current, internal operations, inter- 
nal bus operations, and external bus operations. Those related to system pa- 
rameters are due to operating frequency, supply voltage, output load capaci- 
tance, and operating temperature. The typical power supply current require- 
ment is 200 mA, and the minimum, or quiescent, is 110 mA. 

This application report presents information required to determine power sup- 
ply specifications. Specifications are based on an algorithm's use of internal 
and external buses on the TMS320C30. As devices become more complex, 
the caicuiation of power dissipation becomes more critical. 



The maximum current requirement is 600 mA and occurs only 
under worst case conditions; writing alternating data 
(AAAAAAAAh to 55555555h) out of both external buses 
simultaneously every cycle, with 80 pF loads and running at 33 
MHz. 
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D.7 Photo of l DD for FFT 




500 ixs/Div 

Input Clock Frequency = 33 MHz 
Voltage Level = 5.0 Vqd 
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D.8 FFT Assembly Code 



• GLOBL 
• GLOBL 
•GLOBL 
•GLOBL 



SINTABs 



RAMO : 



• WORD 



• WORD 



OUTBUF: 

• WORD 

• TEXT 
FFT: LDP 



FFT 

N 

M 

SINE 



SINE 

809800h 

800h 

SINTAB 



; setup 



; processing portion: 

; quiescent , internal and 

? bus operations 



LDI N,IR0 
LSH -1,IR0 

} LENGTH— TWO BUTTERFLIES 

LDI @RAM0,AR0 
LDI IR0,RC 
SUBI 1,RC 

RPTB BLK1 

ADDF *+AR0,*AR0++,R0 

SUBF *AR0,*-AR0,R1 
BLK1 STF R0,*-AR0 
| | STF R1,*AR0++ 

; FIRST PASS OF THE DO-20 LOOP (STAGE K=2 IN DO-10 LOOP) 



LDI 


@RAM0 , ARO 


LDI 


2, IRO 


LDI 


N,RC 


LSH 


-2,RC 


SUBI 


1,RC 


RPTB 


BLK2 


ADDF 


*+AR0 ( IRO ) , *AR0++ ( IRO ) , RO 


SUBF 


*AR0,*-ARQ(IR0) ,Rl 


NEGF 


*+AR0,R0 


STF 


RO , *-AR0 ( IRO ) 


STF 


R1,*AR0++(IR0) 


STF 


R0,*+AR0 



? MAIN LOOP (FFT STAGES) 
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LDI 
LSH 
LDI 
LDI 
LDI 

LOOP LSH 
LSH 
LSH 



N, IRO 
-2, IRO 
3,R5 

1, R4 

2, R3 
—1 , IRO 
1,R4 
1,R3 



INNER LOOP (DO-20 LOOP IN THE PROGRAM) 
@RAM0 , AR5 



INLOP5 



II 



LDI 

LDI 

ADD I 

LDI 

LDI 

ADD I 

LDI 

ADD I 

LDI 

SUBI 

ADD I 

LDF 

ADDF 

SUBF 

STF 

NEGF 

NEGF 

STF 

STF 



IRO, ARO 
@SINTAB, ARO 
R4,IR1 
AR5,AR1 

1, ARl 
ARl , AR3 
R3,AR3 
AR3,AR2 

2 , AR2 

R3,AR2,AR4 
*AR5++(IRl) ,R0 
*+AR5(IRl) ,R0,R1 
R0,*++AR5(IR1) ,R0 
R1,*-AR5(IR1) 
RO 

*++AR5(IRl) ,R1 

R0,*AR5 

R1,*AR5 



INNERMOST LOOP 



II 

II 
II 



LDI 


N,IR1 


LSH 


-2,IR1 


LDI 


R4,RC 


SUBI 


2,RC 


RPTB 


BLK3 


MPYF 


*AR3, *+AR0 (IR1) ,RQ 


MPYF 


*AR4,*AR0,R1 


MPYF 


*AR4,*+AR0(IR1) ,Rl 


ADDF 


R0,Rl,R2 


MPYF 


*AR3 , *AR0++ ( IRO ) , RO 


SUBF 


R0,Rl,R0 


SUBF 


*AR2,R0,R1 


ADDF 


*AR2,R0,R1 


STF 


Rl,*AR3++ 


ADDF 


*AR1,R2,R1 


STF 


Rl,*AR4 


SUBF 


R2 f *AR1 „ Rl 
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ii 

BLK3 



DUMP 



II 



STF 


Rl,*ARl++ 


STF 


R1,*AR2 


SUBI 


@RAM0,AR5 


ADD I 


R4,AR5 


CMP I 


N,AR5 


BLTD 


INLOP 


ADDI 


@RAM0,AR5 


NOP 




NOP 




ADDI 


1,R5 


CMPI 


M,R5 


BLE 


LOOP 


LDI 


@RAM0 r ARO 


LDI 


@OUTBUF,AP 


LDF 


*AR0++,R0 


RPTS 


N-2 


LDF 


*AR0++,RQ 


STF 


RO, *ARl++ 


STF 


R0,*AR1++ 


LDI 


RAMO , ARl 


LDI 


@RAM0 , ARO 


XOR 


400h,AR0 


STI 


AR0,*AR1 


B 


FFT 


• END 





; data dump portion 

; quiescent, internal bus 

; ops and primary bus ops 



; swap RAM banks 
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SMJ320C3x Digital Signal Processor 

Data Sheet 



This appendix contains the standalone data sheet for the military version of the 
'C3x digital signal processor, the SMJ320C3x Digital Signal Processor. 
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SMJ320C3X 
DIGITAL SIGNAL PROCESSOR 

SGUS014A- FEBRUARY 1991 - REVISED SEPTEMBER 1994 



• Processed to MIL-STD-883, Class B 

• Operating Temperature Range: 
-55°C to125°C 

• Two 1K x 32-Bit Single-Cycle Dual-Access 
On-Chip RAM Blocks 

• Validated ADA Compiler 

• 64-Word x 32-Bit Instruction Cache 

• 32-Bit Instruction and Data Words, 24-Bit 
Addresses 

• 40/32-Bit Floating-Point /Integer Multiplier 
and ALU 

• Parallel ALU and Multiplier Execution in a 
Single Cycle 

• On-Chip Direct Memory Access (DMA) 
Controller for Concurrent I / O and CPU 
Operation 

• Integer, Floating-Point, and Logical 
Operations 

SMJ320C30 Key Features 

• Performance 

- SMJ320C30-33 (60-ns Cycle) 

33 MFLOPS 
16.7 MIPS 

- SMJ320C30-28 (70-ns Cycle) 

28.6 MFLOPS 
14.3 MIPS 

• One 4K x 32-Bit Single-Cycle Dual-Access 
On-Chip ROM Block 

• Two 32-Bit External Ports (24- and 13-Bit 
Address) 

• Two Serial Ports With Support for 
8 / 1 6 /24 /32-Bit Transfers 

• Two 32-Bit Timers 

• Packaging 

- 181 -Pin Grid Array Ceramic Package 
(GB Suffix) 

- 196-Pin Quad Flat Pack With 
Nonconductive Tie-Bar (HFG Suffix) 

- 244-Pad JEDEC Standard TAB Frame 

• SMD Approval for 28- and 33-MHz Versions 



Two Address Generators With Eight 
Auxiliary Registers and Two Auxiliary 
Register Arithmetic Units 
Zero-Overhead Loops With Single-Cycle 
Branches 

Interlocked Instructions for 
Multiprocessing Support 

32-Bit Barrel Shifter 

Eight Extended-Precision Registers 

(Accumulators) 

Two- and Three-Operand Instructions 
Conditional Calls and Returns 
Block Repeat Capability 
0.8-nm EPIC™ CMOS Technology 



SMJ320C31 Key Features 

• Performance 

- SMJ320C31 -40 (50-ns Cycle) 

40 MFLOPS 
20 MIPS 

- SMJ320C31-33 (60-ns Cycle) 

33.3 MFLOPS 
16.7 MIPS 

- SMJ320C31-27(74-ns Cycle) 

27 MFLOPS 
13.5 MIPS 
9 Flexible Boot Program Loader 

• One Serial Port to Support 
8 / 1 6 /24 /3 2-Bit Transfers 

• One 32-Bit Data Bus (24-Bit Address) 

• Packaging 

- 132-Pin Ceramic Quad Flat Pack With 
Nonconductive Tie-Bar (HFG Suffix) 

- 141-Pin Staggered Grid Array 
(GFA Suffix) 

- 244-Pad JEDEC-Standard TAB Frame 

• SMD Approval for 27- and 33-MHz Versions 



EPIC 8s a trademark of Texas Instruments Incorporated. 



PRODUCTION DATA Information It current at of publication date. — Copyright © 1 994, Texas Instruments Incorporated 

Products conform to epeclficatlont per the terms of Texas Instruments w Sm 

^jf Texas 
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standard warranty. Production processing does not necessarily include 
testing of all parameters. 
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SMJ320C30 . . . GB PACKAGE 
(BOTTOM VIEW) 



SMJ320C31 . . . GFA PACKAGE 
(BOTTOM VIEW) 
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SMJ320C30 . . . HFG PACKAGE 
(TOP VIEW) 



SMJ320C31 . . . HFG PACKAGE 
(TOP VIEW) 





NOTE: Refer to mechanical data section for TAB drawing. 
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description 



The SMJ320C3x's internal busing and special digital signal processing (DSP) instruction set have the speed 
and flexibility to execute up to 33 MFLOPS (million floating-point operations per second). The SMJ320C3x 
optimizes speed by implementing functions in hardware that other processors implement through software or 
microcode. This hardware-intensive approach provides performance previously unavailable on a single chip. 

The emphasis on total system cost has resulted in a less expensive processor that can be designed into systems 
currently using costly bit-slice processors. Also, appropriate selection based on cost and performance is 
enhanced by the different processors in the SMJ320C3x line: 

• SMJ320C30-33: 60-ns single-cycle execution time, 10% supply 

• SMJ320C30-28: 70-ns single-cycle execution time, 5% supply 

• SMJ320C31-40: Low cost, reduced overall size, 50-ns single-cycle execution time, 10% supply 

• SMJ320C31-33: Low cost, reduced overall size, 60-ns single-cycle execution time, 10% supply 

• SMJ320C31-27: Low cost, reduced overall size, 74-ns single-cycle execution time, 10% supply 

The SM J320C30 and SM J320C31 can perform parallel multiply and ALU operations on integer or floating-point 
data in a single cycle. Each processor also possesses a general-purpose register file, a program cache, 
dedicated auxiliary register arithmetic units (ARAU), internal dual-access memories, one DMA channel 
supporting concurrent I/O, and a short machine-cycle time. High performance and ease of use are results of 
these features. 

General-purpose applications are greatly enhanced by the large address space, multiprocessor interface, 
internally and externally generated wait states, external interface ports (two on the SMJ320C30, one on the 
SMJ320C31), two timers, serial ports (two on the SMJ320C30, one on the SMJ320C31), and multiple interrupt 
structure. The SMJ320C3x supports a wide variety of system applications from host processor to dedicated 
coprocessor. 

High-level language support is easily implemented through a register-based architecture, large address space, 
powerful addressing modes, flexible instruction set, and well-supported floating-point arithmetic. 
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functional block diagram 



f y ROM ^ 
O Block / 
'j (4KX32) / 




INT (3-0) - 
IACK« 
MC/MP- 
XF(1,0)< 
V DD (3-0)- 
IODV DD (1,0)— > 
ADV DD (1,0)- 
PDV DD - 
DDV DD (1,0)- 
MDV DD - 
VSSO-O)- 
DV S s(3-0) -+ 
CVssO.0)-* 

ivss- 
Vbbp< 

SUBS- 

X2/CLKIN - 
H14 
H3< 
EMU(6-0)«+ 
RSV(1 0-0)4-+. 



Available on SMJ320C30 
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SMJ320C30 Terminal Assignment 



TERMINAL 


TERMINAL 


TERMINAL 


TERMINAL 


TERMINAL 


NUMBER 




NUMBER 




NUMBER 




NUMBER 




NUMBER 




GB 


HFG 


NAME 


GB 


HFG 


NAME 


GB 


HFG 


NAME 


GB 


HFG 


NAME 


GB 


HFG 


NAME 


PKG 


PKG 




PKG 


PKG 




PKG 


PKG 




PKG 


PKG 




PKG 


PKG 




F15 


82 


AO 


C4 


144 


DO 


R2 


7 


FSX0 


B13 


103 


XA4 


N6 


14 


XD2 


G12 


81 


A1 


D5 


143 


D1 


N4 


5 


CLKR0 


A15 


102 


XA5 


R5 


17 


XD3 


G13 


80 


A2 


A2 


142 


D2 


M5 


6 


CLKX0 


B15 


95 


XA6 


P6 


18 


XD4 


G14 


79 


A3 


A3 


141 


D3 


R1 


3 


DRO 


C14 


94 


XA7 


M7 


19 


XD5 


G15 


78 


A4 


B4 


140 


D4 


R3 


8 


DXO 


E12 


93 


XA8 


R6 


20 


XD6 


H15 


77 


A5 


C5 


139 


D5 


M3 


191 


FSR1 


D13 


92 


XA9 


N7 


21 


XD7 


H14 


72 


A6 


D6 


138 


D6 


P1 


194 


FSX1 


C15 


91 


XA10 


P7 


22 


XD8 


J15 


71 


A7 


A4 


137 


D7 


L4 


192 


CLKR1 


D14 


90 


XA11 


R7 


23 


XD9 


J14 


70 


A8 


B5 


136 


D8 


N2 


193 


CLKX1 


E13 


89 


XA12 


P8 


24 


XD10 


J13 


69 


A9 


C6 


135 


D9 


N1 


190 


DR1 


J3 


179 


RSVO 


R8 


29 


XD11 


K15 


68 


A10 


A5 


134 


D10 


P2 


195 


DX1 


J4 


180 


RSV1 


R9 


30 


XD12 


J12 


67 


A11 


B6 


133 


D11 


F14 


83 


EMUO 


K1 


181 


RSV2 


P9 


31 


XD13 


K14 


66 


A12 


D7 


132 


D12 


E15 


84 


EMU1 


K2 


182 


RSV3 


N9 


32 


XD14 


L15 


65 


A13 


A6 


131 


D13 


F13 


85 


EMU2 


L1 


183 


RSV4 


R10 


33 


XD15 


K13 


63 


A14 


C7 


130 


D14 


E14 


86 


EMU3 


K3 


184 


RSV5 


M9 


34 


XD16 


L14 


62 


A15 


B7 


129 


D15 


F12 


87 


EMU4/SHZ 


L2 


185 


RSV6 


P10 


35 


XD17 


M15 


61 


A16 


A7 


128 


D16 


C1 


155 


EMU5 


K4 


186 


RSV7 


R11 


36 


XD18 


K12 


60 


A17 


A8 


127 


D17 


M6 


11 


EMU6 


M1 


187 


RSV8 


N10 


37 


XD19 


L13 


59 


A18 


B8 


122 


D18 


B3 


145 


H1 


L3 


188 


RSV9 


P11 


38 


XD20 


M14 


58 


A19 


A9 


121 


D19 


A1 


146 


H3 


M2 


189 


RSV10 


R12 


39 


XD21 


N15 


57 


A20 


B9 


120 


D20 


C2 


152 


X1 


D12 


100 


ADV DD 


M10 


40 


XD22 


M13 


56 


A21 


C9 


119 


D21 


B1 


151 


X2/CLKIN 


H11 


64 


ADV DD 


N11 


41 


XD23 


L12 


55 


A22 


A10 


118 


D22 


P4 


9 


TCLKO 


D4 


114 


DDV D d 


P12 


42 


XD24 


N14 


54 


A23 


D9 


117 


D23 


N5 


10 


TCLK1 


E8 


147 


DDV DD 


R13 


43 


XD25 


E5 




LOCATOR 


B10 


116 


D24 


G2 


169 


XFO 


L8 


15 


IUDVdd 


R14 


44 


AU£D 


G1 


170 


IACK 


A11 


115 


D25 


G3 


168 


XF1 


M12 


16 


IODV DD 


M11 


45 


XD27 


H2 


171 


Into 


C10 


113 


D26 


D3 


154 


VBBP 




49 


iodv dd 


N12 


46 


XD28 


H1 


176 


TntT 


B11 


112 


D27 


E4 


153 


VSUBS 


H5 


162 


MDV DD 


P13 


47 


XD29 


J1 


177 


INT2 


A12 


111 


D28 


H4 


123 


vdd 




163 


MDVqd 


R15 


48 


XD30 


J2 


178 


INT3 


D10 


110 


D29 


D8 


73 


v D d 


M4 


1 


PDV DD 


P15 


53 


XD31 


D15 


88 


MC/MP 


C11 


109 


D30 


M8 


74 


vdd 


B2 


51 


cv ss 


C3 


50 


DV S S 


E3 


157 


MSTRB 


B12 


108 


D31 


H12 


124 


vdd 


P14 


52 


cv ss 


C13 


98 


DV S s 


E1 


164 


RDY 


F3 


161 


HOLD 


N8 


27 


vss 


C8 


28 


vss 


N3 


148 


DVss 


F1 


167 


RESET 


E2 


160 


HOLDA 


A13 


107 


XAO 


H3 


75 


vss 


N13 


196 


DV S S 


G4 


166 


R/W 


D2 


156 


XRDY 


A14 


106 


XA1 


H13 


76 


vss 


B14 


96 


ivss 


F2 


165 


STRB 


D1 


159 


XR/W 


D11 


105 


XA2 


R4 


12 


XDO 




97 


ivss 


F4 


158 


IOSTRB 


P3 


4 


FSR0 


C12 


104 


XA3 


P5 


13 


XD1 









NOTES: 1 . ADVqd. DDVqd. IODVdd. MDVqd. and PDVqd are on a common plane internal to the device. 

2. Vdd ' s on a common plane internal to the device. 

3. Vss. cV ss. and IVss are on a common plane internal to the device. 

4. DVss is on a common plane internal to the device. 
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SMJ320C31 Terminal Assignments 



TERMINAL 


TERMINAL 


TERMINAL 


TERMINAL 


NUMBER 




NUMBER 




NUMBER 




NUMBER 




HFG 


GFA 


NAME 


HFG 


GFA 


NAME 


HFG 


GFA 


NAME 


HFG 


GFA 


NAME 


DtCCt 
ftW> 






Died 












DtCfi 
rfxVJi 






12 


L1 


AO 


47 


W9 


D10 


86 


E19 


INT1 


18 


P4 


vssl 


11 


K2 


A1 


46 


U9 


D11 


89 


F18 


INT2 


19 


T10 


vssl 


10 


J1 


A2 


45 


V8 


D12 


90 


G17 


INT3 


20 


K4 


DVss 


9 


J3 


A3 


43 


W7 


D13 


110 


C11 


MCBL/MP 


25 


T4 


ivss 


8 


G1 


A4 


41 


U7 


D14 


77 


L19 


R/W 


34 


G3 


DVss 


6 


F2 


A5 


39 


V6 


D15 


75 


N17 


RDY 


40 


K16 


CVss 


5 


E1 


A6 


38 


W5 


D16 


78 


K18 


RESET 


44 


T8 


ivss 


4 


E3 


A7 


37 


U5 


D17 


101 


A17 


SHZ 


52 


T12 


DV S s 


3 


D2 


A8 


36 


V4 


D18 


76 


M18 


STRB 


53 


R11 


vssl 


1 


C1 


A9 


35 


W3 


D19 


103 


B16 


TCLKO 


54 


J15 


vssl 


131 


C3 


A10 


33 


U3 


D20 


105 


C15 


TCLK1 


67 


W13 


DVss 


129 


B2 


A11 


31 


V2 


D21 


121 


G5 


av D d 


68 


D10 


CVss 


128 


A1 


A12 


30 


W1 


D22 


130 


E7 


A\/r>r> 
• UU 


69 


D16 


IV/r»r» 
■ * OO 


127 


C5 


A13 


29 


R3 


D23 


7 


E5 


av D d 


84 


T16 


DVss 


126 


B4 


A14 


28 


T2 


D24 


15 


N5 


Vddl 


85 


D12 


vssl 


125 


A3 


A15 


27 


U1 


D25 


16 


R5 


Vddl 


92 


F16 


CVss 


124 


C7 


A16 


26 


N3 


D26 


23 


H4 


dvqd 


96 


H16 


ivss 


123 


B6 


A17 


24 


P2 


D27 


32 


J5 


DVqd 


100 


D14 


VSUBS 


122 


C9 


A18 


22 


R1 


D28 


42 


T14 


DVqd 


102 


U15 


DVss 


120 


B8 


A19 


21 


L3 


D29 


48 


R7 


Vddl 


111 


C13 


CVss 


117 


A7 


A20 


17 


M2 


D30 


49 


R9 


Vddl 


71 


1 IO 


X1 


116 


A9 


A21 


14 


N1 


UO 1 


57 


R13 


n\/r^r^ 


70 


u 1 y 


yo/CA KIN 


113 


B10 


A 99 


91 


C19 


nnn 
unu 


66 


R15 


n\/r>p> 


79 


J19 


XFO 


112 


A11 


A23 


99 


C17 


DXO 


74 


P16 


CVqd 


81 


G19 


XF1 


94 


E17 


CLKRO 


107 


B14 


EMUO 


80 


N15 


CVqd 




F6 


No Connect 


95 


A19 


CLKXO 


108 


A13 


EMU1 


87 


G15 


Vddl 




D4 


DVss 


63 


W19 


DO 


109 


B12 


EMU2 


88 


E15 


vddl 




N19 


DVss 


62 


V16 


D1 


106 


A15 


EMU3 


98 


L15 


PVnn 




R17 


DVqq 


61 


W17 


D2 


93 


D18 


FSRO 


104 


E9 


pvdd 




L17 


DVss 


60 


U13 


D3 


97 


B18 


FSXO 


114 


E13 


Vddl 




M16 


DVss 


59 


V14 


D4 


73 


P18 


HOLD 


115 


E11 


Vddl 




D6 


DV S s 


58 


W15 


D5 


72 


R19 


HOLDA 


118 


L5 


v ssl 




A5 


DV S s 


56 


U11 


D6 


64 


V18 


H1 


119 


H2 


DV S S 




D8 


DVss 


55 


V12 


D7 


65 


U17 


H3 


132 


M4 


cvss 








51 


W11 


D8 


82 


H18 


IACK 


2 


F4 


DV S S 








50 


V10 


D9 


83 


J17 


INTO 


13 


T6 


cv S s 









NOTES: 5. CVss. V SSL» ,V SS are on tne sarr >e plane. 

6. AVqd. DVqd. CVqd. PVqd are on the same plane. 

7. VsuBS connects to die metallization. Tie this terminal to clean ground. 
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terminal functions 

This section gives signal descriptions for the SM J320C3x devices in the microprocessor mode. The following 
tables list each signal, the number of terminals, and type of operating mode(s) (i.e., input, output, or 
high-impedance state as indicated by I, O, or Z), and a brief function description. All terminals labele d NC are 
special functions of the device and should not be connected by the user. A line over a signal name (e.g., RESET) 
indicates that the signal is active low (true at logic 0 level). The signals are grouped according to function. 



SMJ320C30 Terminal Functions 



TERMINAL 
NAME QTY 


TYPEt 


DESCRIPTION 


CONDITIONS 
WHEN 
SIGNAL IS Z TYPE* 


PRIMARY BUS INTERFACE 


D31 - DO 32 


l/O/Z 


32-bit data port of the primary bus interface 


S H 


A23-A0 24 


o/z 


24-bit address port of the primary bus interface 


S H R 


R/W 1 


o/z 


Read/write for primary bus interface. R/W is high when a read is performed and low 
when a write is performed over the parallel interface. 


S H R 


STRB 1 


o/z 


External access strobe for the primary bus interface 


S H 


RDY 1 


I 


Ready. RDY indicates that the external device is prepared for a primary-bus-interface 
transaction to complete. 


S 


HOLD 1 


I 


Hold for primary bus interface. When HOLD is a logic low, any ongoing transaction 
is completed. A23-A0, D31 -DO, STRB, and R/W are in the high-impedance state 
and all transactions over the primary bus interface are held until HOLD becomes a 
logic high or the NOHOLD bit of the primary-bus-control register is set. 




HOLDA 1 


o/z 


Hold acknowledge for primary bus interface. HOLDA is generated in response to a 
logic low on HOLD. HOLDA indicates that A23-A0, D31 -DO, STRB, and R/W are 
in the high-impedance state and that all transactions over the bus are held. HOLDA 
is high in response to a logic high of HOLD or when the NOHOLD bit of the 
primary-bus-control register is set. 


S 


EXPANSION BUS INTERFACE 


XD31-XD0 32 


l/O/Z 


32-bit data port of the expansion bus interface 


S R 


XA12-XA0 13 


o/z 


1 3-bit address port of the expansion bus interface 


S R 


XR/W 1 


o/z 


Read/write signal for expansion bus interface. When a read is performed, XR/W is 
held high; when a write is performed, XR/W is low. 


S R 


MSTRB 1 


o/z 


External memory access strobe for the expansion bus interface 


S 


IOSTRB 1 


o/z 


External I/O access strobe for the expansion bus interface 


S 


XRDY 1 


I 


Ready signal. XRDY indicates that the external device is prepared for an expansion- 
bus-interface transaction to complete. 




CONTROL SIGNALS 


RESET 1 


I 


Reset. When RESET is a logic low, the device is in the reset condition. When RESET 
becomes a logic high, execution begins from the location specified by the reset vector. 




INT3-INT0 4 


I 


External interrupts 




IACK 1 


o/z 


Interrupt acknowledge. IACK is set to a logic high by the IACK instruction. This signal 
can be used to indicate the beginning or end of an interrupt-service routine. 


S 


MC/MP 1 


I 


Microcomputer/ microprocessor mode 




XF1.XF0 2 


l/O/Z 


External flags. XF1 and XFO are used as general-purpose l/Os or to support 
interlocked processor instructions. 


S R 



1 1 = input, 0 = output, Z = high-impedance state 

* S = SHZ active, H = HOLD active, R = RESET active 
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SMJ320C30 Terminal Functions (Continued) 



TERMINAL 


TYPEt 




CONDITIONS 


NAME 


QTY 


DESCRIPTION 


WHEN 
SIGNAL IS Z TYPE* 


SERIAL PORT 0 SIGNALS 


CLKXO 1 


l/O/Z 


Serial port 0 transmit clock. CLKXO is the serial shift clock for the serial port 0 
transmitter. 


S 


R 


DXO 1 


l/O/Z 


Data transmit output. Serial port 0 transmits serial data on DXO. 


S 


R 


FSXO 1 


l/O/Z 


Frame synchronization pulse for transmit. The FSXO pulse initiates the transmit data 
process over DXO. 


S 


R 


CLKRO 1 


l/O/Z 


Serial port 0 receive clock. CLKRO is the serial shift clock for the serial port 0 receiver. 


S 


R 


DRO 1 


l/O/Z 


Data receive. Serial port 0 receives serial data via DRO. 


S 


R 


FSRO 1 


l/O/Z 


Frame synchronization pulse for receive. The FSRO pulse initiates the receive data 


S 


R 


process over DRO. 






SERIAL PORT 1 SIGNALS 


CLKX1 1 


l/O/Z 


Serial port 1 transmit clock. CLKX1 is the serial shift clock for the serial port 1 
transmitter. 


S 


R 


DX1 1 


l/O/Z 


Data transmit output. Serial port 1 transmits serial data on DX1 . 


S 


R 


FSX1 1 


l/O/Z 


Frame synchronization pulse for transmit. The FSX1 pulse initiates the transmit data 
process over DX1 . 


S 


R 


CLKR1 1 


l/O/Z 


Serial port 1 receive clock. CLKR1 is the serial shift clock for the serial port 1 receiver. 


S 


R 


DR1 1 


l/O/Z 


Data receive. Serial port 1 receives serial data via DR1 . 


S 


R 


FSR1 1 


l/O/Z 


Frame synchronization pulse for receive. The FSR1 pulse initiates the receive data 
process over DR1. 


S 


R 


TIMER 0 SIGNALS 


TCLKO 1 


l/O/Z 


Timer clock 0. As an input, TCLKO is used by timer 0 to count external pulses. As an 






output, TCLKO outputs pulses generated by timer 0. 


S 


R 


TIMER 1 SIGNALS 


TCLK1 1 


l/O/Z 


Timer clock 1 . As an input, TCLK1 is used by timer 1 to count external pulses. As an 






output, TCLK1 outputs pulses generated by timer 1 . 


S 


R 


SUPPLY AND OSCILLATOR SIGNALS 


V D D 


4 




5-V supply§ 




iodv dd 


2 




5-V supply§ 




ADV DD 


2 




5-V supply§ 




PDV DD 


1 




5-V supply§ 




DDV DD 


2 




5-V supply§ 




mdv dd 


1 




5-V supply§ 




v S s 


4 




Ground 




DV S S 


4 




Ground 




cv S s 


2 




Ground 





1 1 = input, O = output, Z = high-impedance state 

* S = SHZ active, H = HOLD active, R = RESET active 

§ Recommended decoupling capacitor is 0.1 jaF. 
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SMJ320C30 Terminal Functions (Continued) 



TERMINAL 
NAME QTY 


TYPEt 


DESCRIPTION 


CONDITIONS 
WHEN 
SIGNAL IS Z TYPE* 


SUPPLY AND OSCILLATOR SIGNALS (CONTINUED) (see Note 5) 


iv S s 1 


I 


Ground 




Vbbp 1 


NC 


Vbb pump oscillator output 




VSUBS 1 


I 


Substrate terminal. Tie to ground. 




X1 1 


o/z 


Output from the internal oscillator for the crystal. If a crystal is not used, X1 should 
be left unconnected. 


S 


X2/CLKIN 1 


I 


Input to the internal oscillator from the crystal or a clock 




H1 1 


O/Z 


External H1 clock. H1 has a period equal to twice CLKIN. 


S 


H3 1 


o/z 


External H3 clock. H3 has a period equal to twice CLKIN. 


S 


RESERVED^ 


EMU0-EMU2 3 


I 


Reserved. Use pullup resistors to 5 V. 




EMU3 1 


o/z 


Reserved 


s 


EMU4/SHZ 1 


I 


Shutdown high impedance. When active, EMU4/SHZ shuts down the SMJ320C30 
and places all terminals in the high-impedance state. EMU4/SHZ is used for 
board-level testing to ensure that no dual drive conditions occur. CAUTION: A low 
on SHZ corrupts SMJ320C30 memory and register contents. Reset the device with 
SHZ high to restore it to a known operating condition. 




EMU5, EMU6 2 


NC 


Reserved 




RSV0-RSV10 11 


I 


Reserved. Use pullup resistors to 5 V. 




Locator 1 


NC 


Reserved 





1 1 = input, O = output, Z = high-impedance state 

* S = SHZ active, H = HOLD active, R = RESET active 

§ Follow the connections specified for the reserved terminals. Use 18-kQ-22-kQ pullup resistors for best results. All 5-V supply terminals must 
be connected to a common supply plane, and all ground terminals must be connected to a common ground plane. 
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SMJ320C31 Terminal Functions 



TERMINAL 
NAME QTY 


TYPEt 


DESCRIPTION 


CONDITIONS 
WHEN 
SIGNAL IS Z TYPE* 


PRIMARY BUS INTERFACE 


D31-D0 


32 


l/O/Z 


32-bit data port 


S 


H R 


A23-A0 


24 


o/z 


24-bit address port 


S 


H R 


R/W 


1 


o/z 


Read/write. R/W is high when a read is performed and low when a write is performed 
over the parallel interface. 


S 


H R 


STRB 


1 


o/z 


External access strobe 


S 


H 


RDY 


1 


I 


Ready. RDY indicates that the external device is prepared for a transaction 
completion. 




HOLD 


1 


I 


Hold. When HOLD is a logic low, any ongoing transaction is completed. A23-A0, 
D31 -DO, STRB, and R/W are in the high-impedance state and all transactions over 
the primary bus interface are held until HOLD becomes a logic high or the NOHOLD 
bit of the primary-bus-control register being set. 




HOLDA 


1 


o/z 


Hold acknowledge. HOLDA is generated in response to a logic low on HOLD. HOLDA 
indicates that A23-A0, D31 -DO, STRB, and R/W are in the high-impedance state 
and that all transactions over the bus are held. HOLDA is high in response to a logic 
high of HOLD or the NOHOLD bit of the primary-bus-control register being set. 


S 


CONTROL SIGNALS 


RESET 


1 


I 


Reset. When RESET is a logic low, the device is in the reset condition. When RESET 
becomes a logic high, execution begins from the location specified by the reset vector. 




INT3-INT0 


4 


I 


External interrupts 




IACK 


1 


o/z 


Interrupt acknowledge. IACK is set to a logic high by the IACK instruction. This signal 
can be used to indicate the beginning or end of an interrupt-service routine. 


S 


MCBL/MP 


1 


I 


Microcomputer boot loader/microprocessor mode select 




SHZ 


1 


I 


Shutdown high impedance. When active, SHZ shuts down the SMJ320C31 and 
places all terminals in the high-impedance state. SHZ is used for board-level testing 
to ensure that no dual drive conditions occur. CAUTION: A low on SHZ corrupts 
SM J320C31 memory and register contents. Reset the device with SHZ high to restore 
it to a known operating condition. 




XF1.XF0 


2 


i/o/z 


External flags. XF1 and XFO are used as general-purpose l/Os or to support 
interlocked processor instruction. 


S 


R 


SERIAL PORT 0 SIGNALS 


CLKRO 1 


l/O/Z 


Serial port 0 receive clock. CLKRO is the serial shift clock for the serial 
port 0 receiver. 


S 


R 


CLKXO 1 


l/O/Z 


Serial port 0 transmit clock. CLKXO is the serial shift clock for the serial 
port 0 transmitter. 


S 


R 


DRO 1 


l/O/Z 


Data receive. Serial port 0 receives serial data via DRO. 


S 


R 


DXO 1 


l/O/Z 


Data transmit output. Serial port 0 transmits serial data on DXO. 


S 


R 


FSRO 1 


l/O/Z 


Frame synchronization pulse for receive. The FSRO pulse initiates the receive data 
process over DRO. 


S 


R 


FSXO 1 


l/O/Z 


Frame synchronization pulse for transmit. The FSXO pulse initiates the transmit data 
process over DXO. 


S 


R 



1 1 = input, O = output, Z = high-impedance state 

* S = SHZ active, H = HOLD active, R = RESET active 
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SMJ320C31 Terminal Functions (Continued) 



TERMINAL 
NAME QTY 


TYPEt 


DESCRIPTION 


CONDITIONS 
WHEN 
SIGNAL IS Z TYPE* 


TIMER SIGNALS 


TCLKO 1 


l/O/Z 


Timer clock 0. As an input, TCLKO is used by timer 0 to count external pulses. As 
an output, TCLKO output pulses generated by timer 0. 


S 


TCLK1 1 


l/O/Z 


Timer clock 1 . As an input, TCLKO is used by timer 1 to count external pulses. As 
an output, TCLK1 outputs pulses generated by timer 1 . 


S 


SUPPLY AND OSCILLATOR SIGNALS 


H1 1 


o/z 


External H1 clock. H1 has a period equal to twice CLKIN. 




H3 1 


o/z 


External H3 clock. H3 has a period equal to twice CLKIN. 




V D D 20 


I 


5-V supply. All must be connected to a common supply plane.§ 




V S S 20 


I 


Ground. All grounds must be connected to a common ground plane. 




X1 1 


o/z 


Output from the internal crystal oscillator. If a crystal is not used, X1 should be left 
unconnected. 


S 


X2/CLKIN 1 


I 


Internal oscillator input from a crystal or a clock 




RESERVED^ 


EMU2-EMU0 3 


I 


Reserved. Use pullup resistors to 5 V. 




EMU3 1 


o/z 


Reserved 


S 



1 1 = input, O = output, Z = high-impedance state 

* S = SHZ active, H = HOLD active, R = RESET active 

§ Recommended decoupling capacitor value is 0.1 \iF. 

' Follow the connections specified for the reserved terminals. Use 18-kQ-22-kQ pullup resistors for best results. All 5-V supply terminals must 
be connected to a common supply plane, and all ground terminals must be connected to a common ground plane. 
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absolute maximum ratingst 

Supply voltage range, Vqc (see Note 8) -0.3 V to 7 V 

Input voltage range, Vj - 0.3 V to 7 V 

Output voltage range, Vq -0.3 V to 7 V 

Continuous power dissipation (see Note 9) 3.15 W 

Minimum free-air operating temperature, - 55°C 

Maximum operating case temperature, Tc 125°C 

Storage temperature range - 65°C to 1 50°C 



t Stresses beyond those listed under "absolute maximum ratings" may cause permanent damage to the device. These are stress ratings only, and 
functional operation of the device at these or any other conditions beyond those indicated under "recommended operating conditions" is not 
implied. Exposure to absolute-maximum-rated conditions for extended periods may affect device reliability. 
NOTES: 8. All voltage values are with respect to Vss- 

9. Actual operating power is less. This value was obtained under specially produced worst-case test conditions, which are notsustained 
during normal device operation. These conditions consist of continuous parallel writes of a checkerboard pattern to both primary and 
extension buses at the maximum rate possible. See normal (Ice) current specification in the electrical characteristics table and also 
read Calculation of TMS320C30 Power Dissipation Application Report. 



recommended operating conditions (see Note 10) 





MIN 


NOM* 


MAX 


UNIT 






•320C30-28 
'320C31-40 


4.75 


5 


5.25 




v D d 


Supply voltage 


'320C30-33 


4.5 


5 


5.5 


V 






'320C31-27 
'320C31-33 


4.5 


5 


5.5 




vss 


Supply voltage (CV$s. etc.) 




0 


V 


V|H 


High-level input voltage 




2.1 




V DD + 0.3§ 


V 


V T H 


High-level input voltage for CLKIN 




3 




V DD + 0.3§ 


V 


V|L 


Low-level input voltage 




-0.3§ 




0.8 


V 


'OH 


High-level output current 




-300 


jiA 


"OL 


Low-level output current 




2 


mA 


T A 


Operating free-air temperature 




-55 


°C 


TC 


Operating case temperature 




125 


°C 



* All nominal values are at Vdd = 5 V, Ta = 25°C. 

§ These values are derived from characterization and not tested. 

NOTE 10: All input and output voltage levels are TTL compatible. 
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electrical characteristics over recommended ranges of supply voltage (unless otherwise noted) 
(see Note 10) 



PARAMETER 


TEST CONDITIONSt 


MIN 


TYP* 


MAX 


UNIT 


VOH 


High-level output voltage 




Vqd = MIN, ioh = max 


2.4 


3 




V 


vol 


Low-level output voltage 


ForXA12-XA0 


V D D = MIN, Iol = max 


0.6§ 


V 


All others 


Vdd = MIN, Iql = max 




0.3 


0.6 


V 


iz 


High-impedance current 




v D d = max 


±20 


HA 


l| Input current 


V| = Vss to Vdd 


±10 


MA 


"IP 


Input current 




Inputs with internal pullups (s 


ee Note 11) 


-400 




20 


ma 


lie 


Input current (X2/CLKIN) 




V| = Vss t0 V CC 


±50 


ma 










'320C30-33 




200 


600 










T A = 25°C, V D D = max 
tc(CI) = MIN 


'320C31 -33 




150 


325 




•cc 


Supply current 




'320C30-28 




175 


500 


mA 








(see Note 12) 


'320C31-27 




125 


250 












'320C31-40 




250 


400 




Cj 


Input capacitance 






15 4 " 


PF 


C 0 


Output capacitance 








PF 


Cx 


X2/CLKIN capacitance 






25^ 


PF 



t For conditions shown as MIN /MAX, use the appropriate value specified in recommended operating conditions. 

* All typical values are at Vdd = 5 v . T A = 25 ° c - 

§ These values are derived from characterization but not tested. 

11 These values are derived by design but not tested. 

NOTES: 1 0. All input and output voltage levels are TTL compatible. 

11. Terminalswith internal pullup devices: INT0-INT3, MC/MP, RSV0-RSV10. Although RSV0-RSV10 have internal pullup devices, 
external pullups should be used on each terminal as identified in the Terminal Functions tables. 

12. Actual operating current is less than this maximum value. This value was obtained under specially produced worst-case test 
conditions, which are not sustained during normal device operation. These conditions consist of continuous parallel writes of a 
checkerboard pattern to both primary and expansion buses at the maximum rate possible. See Calculation ofTMS320C30 Power 
Dissipation Application Report. 
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PARAMETER MEASUREMENT INFORMATION 



Tester Pin 
Electronics 



VLOAD 




Output 
Under 
Test 



Where: Iql 



2 mA (all outputs) 
300 [lA (all outputs) 
2.15V 

80-pF typical load-circuit capacitance 




Figure 1. Test Load Circuit 



signal transition levels 

TTL-level outputs are driven to a minimum logic-high level of 2.4 V and to a maximum logic-low level of 0.6 V. 
Output transition times are specified as follows: 

• For a high-to-low transition on a TTL-compatible output signal, the level at which the output is said to be 
no longer high is 2 V, and the level at which the output is said to be low is 1 V. 

• For a low-to-high transition, the level at which the output is said to be no longer low is 1 V, and the level at 
which the output is said to be high is 2 V. 



Transition times for TTL-compatible inputs are specified as follows: 

• For a high-to-low transition on an input signal, the level at which the input is said to be no longer high is 
2.1 V, and the level at which the input is said to be low is 0.8 V. 

• For a low-to-high transition on an input signal, the level at which the input is said to be no longer low is 
0.8 V, and the level at which the input is said to be high is 2.1 V. 




Figure 2. TTL-Level Outputs 





Figure 3. TTL-Level Inputs 
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PARAMETER MEASUREMENT INFORMATION 



timing parameter symbology 

Timing parameter symbols used herein were created in accordance with JEDEC Standard 100-A. In order to 
shorten the symbols, some of the terminal names and other related terminology have been abbreviated as 
follows, unless otherwise noted: 

• INT includes IRT3-i NT0 

• (M)S in symbols and (M)STRB in description includes STRB and MSTRB 

• (X)A includes A23-A0 and XA1 2-XAO 

• (X)D includes D31 -DO and XD13-XD0 

• (X)RW in symbols and (X) R/W in description includes R/W and XR/W 

• (X)RDY includes RDY and XRDY 
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timing parameters for CLKIN, H1, and H3 (see Note 10) 



NO. 






•320C31-27 


'320C30-28 


'320C30-33 
•320C31-33 


•320C31 


.40 


UNIT 








MIN 


MAX 


MIN 


MAX 


MIN 


MAX 


MIN 


MAX 




1 


tf(CI) 


Fall time, CLKIN 


5t 


5t 


5t 


5t 


ns 


2 


tw(CIL) 


Pulse duration, CLKIN low, 
*c(CI) = iviiiM (see iMote ioj 


13 


12.25 


10.5 


9 


ns 


3 


tw(CIH) 


Pulse duration, CLKIN high, 
tc(ci) - MIN (see Note 13) 


13 


12.25 


10.5 


9 


ns 


4 


V(CI) 


Rise time, CLKIN 


5t 


5t 


5t 


5t 


ns 


5 


tc(CI) 


Cycle time, CLKIN 


37 


303 


35 


303 


30 


303 


25 


303 


ns 


o 


l f(H) 


Call tima UH /UO 

ran lime, m / no 


4 


3 


3 


3 


ns 


7 


tw(HL) 


Pulse duration, H1 /H3 low 
(see Note 14) 


P-6 


P-6 


P-6 


P-5 


ns 


8 


*w(HH) 


Pulse duration, H1 /H3 high 
(see Note 14) 


P-7 


P-7 


P-7 


P-6 


ns 


9 


tr(H) 


Rise time, H1 /H3 


4 


4 


4 


3 


ns 


9.1 


id(HL-HH) 


Deiay iime, HI / H3 iow to H3/H1 high 




5 


_ + 
u+ 


5 


u+ 


5 




4 


ns 


10 


^(H) 


Cycle time, H1/H3 


74 


606 


70 


606 


60 


606 


50 


606 


ns 



t These values are derived by design but not tested. 
* These values are derived from characterization but not tested. 
NOTES: 1 0. All input and output voltage levels are TTL compatible. 

1 3. Rise and fall times, assuming a 35 - 65% duty cycle, are incorporated within this specification (see Figure 4). 

14- P-tc(CI) 



X2/CLKIN 




5 V) 



Figure 4. X2/CLKIN Timing 



|« — 2—H 



H1 



H3 




■10- 



Figure 5. H1/H3 Timing 
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SMJ320C30-33 
4.5 V x Vpp a 5.5 V 




1- 



-55°C 25°C 125°C 

Temperature 

Figure 6. CLKIN to H1/H3 as a Function of Temperature 
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memory-read-cycle and memory-write-cycle timing ((M)STRB = 0) (see Figures 7 and 8) 



NO. 






•320C31-27 


'320C30-28 


•320C30-33 
'320C31-33 


'320C31-40 


UNIT 








MIN 


MAX 


MIN 


MAX 


MIN 


MAX 


MIN 


MAX 




11 


tdrH1L-(M)SLl 


Delay time, H1 low to (M)STRB low 


ot 


10 


ot 


10 


ot 


10 


ot 


6 


ns 


12 


td[H1L-(M)SH] 


Delay time, H1 low to (M)STRB 
high 


0* 


10 


ot 


10 


ot 


10 


ot 


6 


ns 


13.1 


td(HIH-RWL) 


Delay time, H1 high to R/W low 


0* 


10 


ot 


10 


ot 


10 


ot 


9 


ns 


13.2 


tdrH1H-(X)RWLl 


Delay time, H1 high to (X)R/Wlow 




ot 


17 


ot 


15 




ns 


14.1 


*d(H1L-A) 


Delay time, H1 low to A valid 


0* 


16 


ot 


16 


ot 


14 


ot 


10 


ns 


14.2 


tdrH1L-(X)Al 


Delay time, H1 low to (X)A valid 




ot 


13 


ot 


10 




ns 


15.1 


tsu(D)R 


Setup time, D valid before H1 low 
(read) 


18 


19 


16 


14 


ns 


15.2 


••su^AU^n 


Setup time, (X)D before H1 low 
(read) 


1 


20 


18 




ns 


16 


thr(X)DlR 


Hold time, (X)D after H1 low (read) 


ot 


ot 


ot 


ot 


ns 


17.1 


tenmnv\ 


Setup time, RDY before H1 high 


10 


10 


8 


8 


ns 


17.2 


tsu(XRDY) 


Setuptime, (X) RDY before H1 high 




10 


9 




ns 


18 


thNX)RDYl 


Hold time, (X)RDY after H1 high 


0 


0 


0 


0 


ns 


19 


td[H1H-(X)RWH] 


Delay time, H1 high to (X)R/W 
high (write) 


12 


12 


10 


9 


ns 


20 


M00D1W 


Valid time, (X)D after H1 low (write) 


20 


20 


20 


17 


ns 


21 


*h[(X)D]W 


Hold time, (X)D after H1 high 
(write) 


ot 


ot 


ot 


ot 


ns 


22.1 


td(H1H-A) 


Delay time, H1 high to A valid on 
back-to-back write cycles (write) 


22 


22 


18 


15 


ns 


22.2 


td[H1H-(X)A] 


Delay time, H1 high to (X) A valid on 
back-to-back write cycles (write) 




32 


25 




ns 


26 


td[A-{X)RDYl 


Delay time, (X)RDY from A valid 


8* 


8* 


8* 


7* 


ns 



t These values are derived by design but not tested. 

* These values are derived from characterization but not tested. 
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H3 
H1 



(M)STRB 
(X)R/W 

(X)A 




14.1/14.2 



Z 



— M 

I | 13.1/13.2 -* *=- 

-\Jr- i — ^ 

_y\ _A 

1 5.1/15.2 -»l («— 

o 



41. 



MD 17.1/17.2 -*J Up - 



(X)RDY 



Figure 7. Memory-Read-Cycle Timing ((M)STRB = 0) 




(X)RDY 



Figure 8. Memory-Write-Cycle Timing ((M)STRB = 0) 
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memory-read-cycle timing (IOSTRB = 0, SMJ320C30 only) 



NO. 






'320C30-28 


'320C30-33 


UNIT 






MIN 


MAX 


MIN 


MAX 


27 


tdfl-HH-IOSLl 


Delay time, H1 high to IOSTRB low 


ot 


11 


ot 


10 


ns 


28 


tdO-MH-IOSH) 


Delay time, H1 high to IOSTRB high 


ot 


10 


ot 


10 


ns 


29 


tdrH1L-(X)RWHl 


Delay time, H1 low to (X)R/W high 


ot 


11 


ot 


10 


ns 


30 


tdrH1L-(X)Al 


Delay time, H1 low to (X)A valid 


ot 


12 


ot 


10 


ns 


31 


WOODIR 


Setup time, (X)D before H1 high 


15 


15 


ns 


32 


thrOODIR 


Hold time, (X)D after H1 high 


0* 


0* 


ns 


33 


tsu[(X)RDYl 


Setup time, (X)RDY before H1 high 


10 


9 


ns 


34 


th[(X)RDY] 


Hold time, (X)RDY after H1 high 


0 


0 


ns 



t These values are derived 
t These values are derived 



by design but not tested. 

from characterization but not tested. 



^^^^ 



H1 



IOSTRB 



(X)R/W 



(X)A 



(X)D 



\ 

I 
I 


27 — « k- 28 j#- 


r 


I 
I 

H 


W— 29 1 






? ! — 


\ 






X 


31 -w W- 

1— >l k— 32 



(X)RDY 



I -*l 14-34 



Figure 9. SMJ320C30 Memory-Read-Cycle Timing (IOSTRB = 0) 
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memory-write-cycle timing (IOSTRB = 0, SMJ320C30 only) 



NO. 




'320C30-28 


'320C30-33 


UNIT 


MIN MAX 


MIN MAX 


35 


*d (H 1 L-XRWL) Delay time, H 1 low to XR / W low 


Ot 15 


Ot 15 


ns 


36 


M(X)D1W Valid time » (*) D after H1 n '9 n 


30 


30 


ns 


37 


th[(X)D]W Hold time » (X) D after H1 ,ow 


0 


0 


ns 



t These values are derived by design but not tested. 

S 



H3 



H1 



IOSTRB 



(X)R/w 



(X)A 



(X)D 



(X)RDY 



27- 



28-N \4- 



X 



f 



35 



f* — H- 



30 



X 



29 



Z 



xz 



36 



37 



33 



> 



l-frl K— 34 

I I 



Figure 10. SMJ320C30 Memory-Write-Cycle Timing (IOSTRB = 0) 
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timing for XFO and XF1 when executing LDFI or LDII 



NO. 




•320C31-27 


'320C30-28 


'320C30-33 
'320C31-33 


'320C31-40 


UNIT 


MIN MAX 


MIN MAX 


MIN MAX 


MIN MAX 


38 


td(H3H-XF0L) Delay time, H3 high to XFO low 


19 


15 


15 


13 


ns 


39 


tsu(XF1) Setup time, XF1 valid before H1 low 


13 


15 


12 


9 


ns 


40 


th(XF1) H 0 '^ time, XF1 after H1 low 


0 


0 


0 


0 


ns 




Figure 11. Timing for XFO and XF1 When Executing LDFI or LDII 



E-24 



Texas 
Instruments 

POST OFFICE BOX 1443 * HOUSTON, TEXAS 77251-1443 



SMJ320C3X 
DIGITAL SIGNAL PROCESSOR 

SGUS014A- FEBRUARY 1991 - REVISED SEPTEMBER 1994 



timing for XFO when executing a STFI or STII 



NO. 




•320C31 -27 


'320C30-28 


'320C30-33 
'320C31 -33 


'320C31 -40 


UNIT 


MIN MAX 


MIN MAX 


MIN MAX 


MIN MAX 


41 


^(HSH-XFOH) Dela y t' m ©. H 3 high to XFO high 


19 


20 


18 


13 


ns 



Fetch 

| STFI or STII | Decode | 



Read Execute 




(M)STRB 
(X)R/W 
(X)A 
(X)D 



■\ r 



< 



PQRDY 



41 



XFO 



T 



Figure 1. Timing for XFO When Executing a STFI or STII 
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timing for XFO and XF1 when executing SIGI 



NO. 




•320C31 -27 


'320C30-28 


'320C30-33 
'320C31 -33 


'320C31-40 


UNIT 


MIN MAX 


MIN MAX 


MIN MAX 


MIN MAX 


41.1 


^(HSH-XFOL) Delay time, H3 high to XFO low 


19 


15 


15 


13 


ns 


42 


td(H3H-XF0H) Dela y time > H3 ni 9h to XFO high 


19 


20 


18 


13 


ns 


43 


*su(XF1) Setup time, XF1 valid before H1 low 


13 


12 


12 


9 


ns 


44 


th(XF1) Hold time, XF1 after H1 low 


0 


0 


0 


0 


ns 



H3 



H1 



XFO 



XF1 




Figure 2. Timing for XFO and XF1 When Executing SIGI 



timing for loading XF register when configured as an output 



NO. 




'320C31 -27 


•320C30-28 


•320C30-33 
'320C31 -33 


'320C31-40 


UNIT 


MIN MAX 


MIN MAX 


MIN MAX 


MIN MAX 


45 


MH3H-XF) Valid time - H3 n '9 n to XF valid 


19 


20 


15 


13 


ns 



OUTXF Bit 



XFx 




Figure 3. Timing for Loading XF Register When Configured as an Output 
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change of XF from output to input mode 



NO. 




'320C31 -27 


'320C30-28 


'320C30-33 
'320C31 -33 


'320C31 -40 


UNIT 


MIN MAX 


MIN MAX 


MIN MAX 


MIN MAX 


46 


^(HSH-XFOI) Hold time « XF after H3 hi 9 h 


20t 


20t 


15t 


13t 


ns 


47 


tsu(XF) Setup time, XF before H1 low 


12 


12 


12 


9 


ns 


48 


th(XF) Hold time . XF after H1 ,ow 


0 


0 


0 


0 


ns 



t These values are derived from characterization but not tested. 



l/OXFx Bitt 



XFx 



I Execute 
I LoadoflOF 



Buffers Go 
| From Output 
to Input 



Value on 
Synchronizer I Terminal I 
Delay j Seen In IOF j 




INXFx Bltt 

CamnlaH \ r 

Data 
Seen 

t l/OXFx represents either bit 1 or bit 5 of the IOF register, and INXFx represents either bit 3 or bit 7 of the IOF register 
depending on whether XFO or XF1 , respectively, is being affected. 

Figure 4. Change of XFx From Output to Input Mode 
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change of XFx from input to output mode 



NO. 




'320C31-27 


'320C30-28 


'320C30-33 
'320C31 -33 


'320C31-40 


UNIT 


MIN MAX 


MIN MAX 


MIN MAX 


MIN MAX 


49 


Delay time, H3 high to XF switching 
d ( H3H " XFI °) from input to output 


20 


20 


20 


17 


ns 



Execution of 
Load of IOF 



H3 



H1 



l/OXFx BItt 



XFxt 



i 



49 



t I/OXFx represents either bit 1 or bit 5 of the IOF register, and INXFx represents either bit 3 or bit 7 of the IOF register 
depending on whether XFO or XF1 , respectively, is being affected. 

Figure 5. Change of XF From Input to Output Mode 
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reset timing 

RESET is an asynchronous input that can be asserted at any time during a clock cycle. If the specified timings 
are met, the exact sequence shown in Figure 1 7 occurs; otherwise, an additional delay of one clock cycle may 
occur. R/ W and XR/W are in the high-impedance state during reset and can be provided with a resistive pullup, 
nominally 18 kQ to 22 kQ, to prevent spurious writes from occurring. The asynch ronous reset signals include 
XFO/1 , CLKXO/1 , DXO/1 , FSXO/1 , CLKRO/1 , DRO/1 , FSRO/1 , and TCLKO/1 . HOLD is an asynchronous input and 
can be asserted during reset. 

Resetting the device initializes the primary- and expansion-bus control registers to seven software wait states 
and, therefore, results in slow external accesses until these register are initialized. 



reset timing [P = t c(C |)] 



NO. 






'320C30-28 


'320C31 


-27 


'320C30-33 
'320C31-33 


'320C31 


-40 


UNIT 








MIN 


MAX 


MIN 


MAX 


MIN 


MAX 


MIN 


MAX 




50 


tsu(RESET) 


Setup time, RESET before 

CA KIN low 

vLlM 1 ^1 IUW 


10 


Pt 


10 


Pt 


10 


Pt 


10 


P* 


ns 


51 


td(CLKINH-HIH) 


Delay time, CLKIN high to 
H1 high§ 


3 


18 


2 


14 


2 


14 


2 


14 


ns 


52 


l d(CLKINH-H1 L) 


Delay time, CLKIN high to 
H1 low§ 


3 


18 


2 


14 


2 


14 


2 


14 


ns 


53 


tsu(RESETH-HIL) 


Setup time, RESET high 
before H1 low after 10 H1 
clock cycles 


15 


13 


10 


9 


ns 


54 


td(CLKINH-H3L) 


Delay time, CLKIN high to 
H3 low§ 


3 


18 


2 


14 


2 


14 


2 


14 


ns 


55 


td(CLKINH-H3H) 


Delay time, CLKIN high to 
H3 high§ 


3 


18 


2 


14 


2 


14 


2 


14 


ns 


56 


tdis(HIH-XD) 


H1 high to (X)D high- 
impedance state 


20t 


19t 


18t 


15t 


ns 


57 


tdis(H3H-XA) 


H3 high to (X)A high- 
impedance state 


12t 


12t 


10t 


ot 


ns 


58 


td(H3H-CONTROLH) 


Delay time, H3 high to 
control signals high 


10t 


10t 


iot 


9t 


ns 


59 


td(HIH-IACKH) 


Delay time, H1 high to 
lACKhigh 


12t 


12t 


10t 


9t 


ns 


60 


tdis(RESETL-ASYNCH) 


RESET low to 
asynchronously reset 
signals to high-impedance 
state 


25t 


25t 


25t 


2lt 


ns 



t These values are derived from characterization but not tested, 
t These values are derived by design but not tested. 

§ See NO TAG for temperature dependence for the 33-MHz SMJ320C30 and SMJ320C31 . 
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CLKIN 



-*l k-50 ! I 

jr 




(X)D 
(see Note A) 

(X)A 
(see Notes A 
andB) 

Control Signals 
(see Note C) 



Asynchronous H 
Reset Signals } w 

(see Note D) 

NOTES: A. Reset vector is fetched three times with 7 software wait states each. 

B. (X)A includes A23-A0, XA12-XA0, an d (X) R/W. 

C. Control signals include STRB, MSTRB, and IOSTRB. 

D. Asynchronously reset signals include XF1 , XFO, CLKXO, DXO, FSXO, CLKRO, DRO, FSRO, CLKX1 , DX1 , FSX1 , CLKR1 , DR1 , FSR1 , 
TCLKO, and TCLK1. 



Figure 6. Reset Timing 
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INT3-INT0 response timing [Q = t c (H)] 



NO. 




'320C30-28 


'320C31-27 


CO CO 
CO CO 

O T- 

CO CO 

o o 

o o 

CM CM 
CO CO 


'320C31-40 


UNIT 


MIN MAX 


MIN MAX 


MIN MAX 


MIN MAX 


61 


Setup time, INT3-INT0 
tsu ( |NT ) before H1 low 


15 


15 


15 


13 


ns 


62 


t Pulse duration, IT?f3-INT0, 

(sie N Notes 1 and 2) t0 aSsure on| y one 
N ' seen 


Q <2Qt 


Q <2Qt 


Q <2Qt 


Q <2Qt 


ns 



t These values derived from characterization but not tested. 
NOTES: 1 . Interrupt pulse duration must be at least 1 Q wide to assure it is seen. It must be less than 2Q wide to assure it is responded to only 
once. 

2. INT3-INT0 are asynchronous inputs and can be asserted at any point during a clock cycle. The SMJ320C3x interrupts are level 
sensitive, not edge sensitive. Interrupts are detected on the falling edge of H1 . For the processor to recognize only one interrupt on 
a given input, an interrupt pulse must be set up and held to a minimum of one H1 falling edge and no more than two H1 falling edges. 
The SMJ320C3x can accept an interrupt from the same source every two H1 clock cycles. If the specified timings are met, the exact 
sequence shown occurs; otherwise, an additional delay of one clock cycle may occur. 



INT3-INT0 
Terminals 



INT3-INT0 
Flag 



Addr 



Data 



Reset or 
Interrupt 
Vector 
Read 



62 if 



Vector Address 



Fetch First 
Instruction of 
Service Routine | 




First 
Instruction 
Address 



<Z3> 
O 



Figure 7. INT3-INT0 Response Timing 
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interrupt-acknowledge (IACK) timing 



NO. 




'320C30-28 


•320C31 -27 


'320C30-33 
•320C31-33 


'320C31-40 


UNIT 


MIN MAX 


MIN MAX 


MIN MAX 


MIN MAX 


63 


td(H1 H-IACKL) D ©'ay time, H1 high to IACK low 


12 


12 


10 


9 


ns 


64 


kKHI H-IACKH) Dela y time » H1 hi 9 h t0 ,ACK high 


12 


12 


10 


9 


ns 



Fetch IACK . . IACK 

Instruction | | Data Read 




Figure 8. Interrupt-Acknowledge IACK Timing 
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serial-port timing 



NU. 








'320C30-28 


'320C31 -27 


UNIT 








MIN 


MAX 


MIN 


MAX 


65 


td(H1-SCK) 


Delay time, H1 high to internal CLKX/R 




17 


17 


ns 


66 


tySCK) 


uycie time, ULrsA/ h 


CLKX/R ext 


tc(H)*2.6t 


tc (H )x2.6t 


ns 


CLKX/Rint 


tc(H)* 2 


tc(H)x232* 


tc(H)x2 


tc(H)x232* 


67 


tw(SCK) 


ruise duration, ulkx/r high/ low 


CLKX/R ext 


tc(H)+15t 


tc(H)+12t 


ns 


CLKX/Rint 


[tc(SCK)/2]-15 


[tc(SCK)/2]+5 


[tc(SCK)/2]-15 


[tc(SCK)/2]+5 


68 


*r(SCK) 


Rise time, CLKX/R 




8t 


8t 


ns 


69 


tf(SCK) 


Fall time, CLKX/R 




8t 


8t 


ns 


70 


tyDX) 


ueiay time, olka to ua valid 


CLKX ext 


35 


35 


ns 


CLKX int 


20 


20 


71 


tsu(DR) 


Setup time, DR before CLKR low 


CLKR ext 


10 


10 


ns 


CLKR int 


25 


25 


72 


*h(DR) 


Hold time, DR from CLKR low 


CLKR ext 


10 


10 


ns 


CLKR int 


ot 


ot 


73 


td(FSX) 


Delay time, CLKXto internal FSX high /low 


CLKX ext 


32 


32 


ns 


CLKX int 


17 


17 


74 


tsu(FSR) 


Setup time, FSR before CLKR low 


CLKR ext 


10 


10 


ns 


CLKR int 


10 


10 


75 


*h(FS) 


Hold time, FSX/R input from CLKX/R low 


CLKX/R ext 


10 


10 




CLKX/R int 


0 


0 


76 


tsu(FSX) 


Setup time, external FSX before CLKX 


CLKX ext 


-[tc(H)"8] 


[tc(SCK)/2]-10* 


~[tc(H)-8] 


[tc(SCK)/2]-10* 


ns 


CLKX int 


-[tc(H)-21] 


*c(SCK)/2* 


-[tc(H)"21] 


tc(SCK)/2* 


77 


*d(CH-DX)V 


Delay time, CLKX to first DX bit, FSX precedes 


CLKX ext 


36 


36 


ns 


CLKX high 


CLKX int 


21 


21 


78 


td(FSX-DX)V 


Delay time, FSX to first DX bit, CLKX precedes FSX 


36 


36 


ns 


79 


tdDXZ 


Delay time, CLKX high to DX high impedance following last data bit 


20t 


20t 


ns 



t These values are derived from characterization but not tested. 
* These values are derived by design but not tested. 



serial-port timing (continued) 



NO. 








'320C30-33 
'320C31 -33 


'320C31 -40 


UNIT 










MIN 


MAX 


MIN 


MAX 




65 


td(H1-SCK) 


Delay time, H1 high to internal CLKX/R 




15 


13 


ns 


66 


tc(SCK) 


Cycle time, CLKX/R 


CLKX/R ext 


tc(H)*2.6t 


tc(H)x 2.6t 


ns 


CLKX/R int 


tc(H)*2 


tc(H)*232* 


tc(H)*2 


tc( H )x232* 


67 


tw(SCK) 


Pulse duration, CLKX/R high/low 


CLKX/R ext 


tc(H)+12t 


tc(H)+12+ 


ns 


CLKX/R int 


[tc(SCK)/2]-15 


[tc(SCK)/2]+5 


[tc(SCK)/2]-15 


[*c(SCK)/2]+5 


68 


tr(SCK) 


Rise time, CLKX/R 




8t 


7t 


ns 


69 


tf(sCK) 


Fall time, CLKX/R 




8t 


7t 


ns 


70 


td(DX) 


Delay time, CLKX to DX valid 


CLKX ext 


35 


30 


ns 


CLKX int 


20 


17 


71 


tsu(DR) 


Setup time, DR before CLKR low 


CLKR ext 


10 


9 


ns 


CLKR int 


25 


21 


72 


th(DR) 


Hold time, DR from CLKR low 


CLKR ext 


10 


9 


ns 


CLKR int 


ot 


0 


73 


tyFSX) 


Delay time, CLKX to internal FSX high /low 


CLKX ext 


32 


27 


ns 


CLKX int 


17 


15 


74 


tsu(FSR) 


Setup time, FSR before CLKR low 


CLKR ext 


10 


9 


ns 


CLKR int 


10 


9 


75 


th(FS) 


Hold time, FSX/R input from CLKX/R low 


CLKX/R ext 


10 


9 


ns 


CLKX/R int 


0 


0 


76 


tsu(FSX) 


Setup time, external FSX before CLKX 


CLKX ext 


-[tc(H)"8] 


[tc(SCK)/2]-10* 


-[tc(H)-8j 


[tc(SCK)/2]-10* 


ns 


CLKX int 


-[tc(H)"21] 


tc(SCK)/2* 


-[tc(H)"21] 


tc(SCK)/2* 


77 


td(CH-DX)V 


Delay time, CLKX to first DX bit, FSX 


CLKX ext 


36 


30 


ns 


precedes CLKX high 


CLKX int 


21 


18 


78 


td(FSX-DX)V 


Delay time, FSX to first DX bit, CLKX precedes FSX 


36 


30 


ns 


79 


tdDXZ 


Delay time, CLKX high to DX high impedance following last data bit 


20t 


17t 


ns 



o CO 

eE 

>8 

> 
■o 

30 
O 
O 
IT! 
CO 
C/> 
O 



> 

i 



3D 



CD 

m 
u 



t These values are derived from characterization but not tested. 
* These values are derived by design but not tested. 
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CLKX/R 



DX« 



i i 

I 70-^ 



J SpZ!L3l NL 



a r 



72 



79 



Bit 



n - 1 ~X Bit n - 2 



BitO 



DR 




FSR 



FSX (int) 



FSX 



(ext)_yf 



I 



Bit n - 1 



— N 74 



Bltn-2 



73 



jT—tt: 



75 



i r 



76 



-75 



NOTES: A. Timing diagrams show operations with the serial-port global-control register bits CLKXP = CLKRP = FSXP = FSRP = 0. 

B. These timings are valid for all serial port modes, including handshake, except where otherwise indicated. For a functional description 
of serial port operation, refer to the TMS320C3x User's Guide. 

C. Timing diagrams depend upon the length of the serial-port word, where n = 8, 16, 24, or 32 bits, respectively. 

Figure 1. Serial-Port Timing, Fixed-Data-Rate Mode 



CLKX/R 
FSX (Int) 

FSX (ext) 



73 



> 



78 



"X. 



y 



1 1 



70- 



-77 



DX 



FSR 



Bit n - 1 



± 



79 



Bitn-2 



BltO 



jr 



75 



DR 



68— »l ,4- 



Blt n - 1 



71- 



Bitn-2 



Bitn-3 



72 



NOTES: A. Timing diagrams show operations with the serial-port global-control register bits CLKXP = CLKRP = FSXP = FSRP = 0. 

B. These timings are valid for all serial-port modes, including handshake, except where otherwise indicated. 

C. Timings not expressly specified for variable-data-rate mode are the same as those for fixed-data-rate mode. 

D. Timing diagrams depend upon the length of the serial-port word, where n = 8, 16, 24, or 32 bits, respectively. 

Figure 2. Serial-Port Timing, Variable-Data-Rate Mode 
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HOLD/HOLDA timing (see Note 1) 



NO. 






'320C30-28 


'320C31 


-27 


'320C30-33 
'320C31-33 


'320C31 


.40 


UNIT 








MIN 


MAX 


MIN 


MAX 


MIN 


MAX 


MIN 


MAX 




80 


tsu(HOLD) 


Setup time, HOLD 
before H1 low 


15 


15 


15 


13 


ns 


81 


W(HOLDA) 


Valid time, HOLDA after 
H1 low 


ot 


10 


ot 


10 


ot 


10 


0* 


9 


ns 


82 


tw(HOLD) 


Pulse duration, HOLD 
low 


^(H) 


2tc(H) 




ac(H) 


ns 


83 


tw(HOLDA) 


Pulse duration, HOLDA 
low 


tc(H)-5* 


tc(H)~5* 


tc(H)~5* 


tc(H)-5* 


ns 


84 


td(H1L-SH)H 


Delay time, H1 low to 
STRB high for a HOLD 


ot 


10* 


ot 


10* 


ot 


10* 


ot 


9* 


ns 


85 


*dis(H1L-S) 


Disable time, H1 low to 
STRB high impedance 


ot 


10* 


ot 


10* 


ot 


10* 


ot 


9* 


ns 


OO 


ten(HIL-S) 


Enable time, H1 low to 
STRB active 


ot 


10* 


ot 


10* 


ot 


10* 


ot 


9* 


ns 


87 


tdis(HIL-RW) 


Disable time, H1 low to 
R/W high impedance 


ot 


10* 


ot 


10* 


ot 


10* 


ot 


9* 


ns 


88 


l ©n^m L-rivvj 


Enable time, H1 low to 
R/W active 


ot 


10* 


ot 


10* 


ot 


10* 


ot 


9* 


ns 


89 


tdis(HIL-A) 


Disable time, H1 low to 
address high 
impedance 


ot 


15* 


ot 


13* 


ot 


10* 


ot 


9* 


ns 


90 


ten(HIL-A) 


Enable time, H1 low to 
address valid 


ot 


15* 


ot 


15* 


ot 


15* 


ot 


13* 


ns 


91 


tdis(HIH-D) 


Disable time, H1 high to 
data high impedance 


ot 


15* 


ot 


15* 


ot 


15* 


ot 


12* 


ns 



t These values are derived by design but not tested. 
* These values are derived from characterization but not tested. 



NOTE 1 : HOLD is an asynchronous input and can be asserted at any point during a clock cycle. If the specified timings are met, the exact 
sequence shown in Figure 3 occurs; otherwise, an additional delay of one clock cycle can occur. The NOHOLD bit of the 
primary-bus-control register (refer to the TMS320C3x User's Guide) overrides the HOLD signal. When this bit is set, the device comes 
out of hold and prevents future hold cycles from occurring. 



E-36 
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* I f 91 



Write Data 



NOTE A: HOLDA goes low in response to HOLD going low and continues to remain low through one H1 cycle after HOLD returns to high. 



Figure 3. HOLD/HOLDA Timing 
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peripheral-terminal general-purpose I/O timing (see Note 2) 



NO. 




'320C30-28 


'320C31-27 


•320C30-33 


'320C31-40 


UNIT 


MIN MAX 


MIN MAX 


MIN MAX 


MIN MAX 


92 


. Setup time, general-purpose input 
tsu(GPIOHIL) before H1 low 


15 


15 


12 


10 


ns 


93 


Hold time, general-purpose input 
th(GPIOH1L) after hi low 


0 


0 


0 


0 


ns 


94 


Delay time, general-purpose output 
td(GPIOH1H) after H1 high 


15 


15 


15 


13 


ns 



NOTE 2: Peripheral terminals include CLKX0/1 , CLKR0/1 , DX0/1 , DR0/1 , FSX0/1 , FSR0/1 , and TCLK0/1 . The modes of these terminals are 
defined by the contents of internal control registers associated with each peripheral. 




Figure 4. Peripheral-Terminal General-Purpose I/O Timing 
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change of peripheral terminal from general-purpose output to input mode 



NO. 




'320C31-27 


'320C30-33 
'320C31-33 


•320C31-40 


UNIT 


MIN MAX 


MIN MAX 


MIN MAX 


95 


*h(H3H) Hold time after H1 high 


15t 


15t 


13t 


ns 


96 


tsu(GPIOH1 L) Setup time, peripheral terminal before H1 low 


13 


12 


9 


ns 


97 


th(GPIOH1 L) Ho,d time ' peripheral terminal after H1 low 


0 


0 


0 


ns 



t These values are derived by design but not tested. 
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Figure 5. Change of Peripheral Terminal From General-Purpose Output to Input Mode 
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change of peripheral terminal from general-purpose input to output mode 



NO. 




'320C31 -27 


'320C30-33 
'320C31-33 


'320C31-40 


UNIT 


MIN MAX 


MIN MAX 


MIN MAX 


98 


Delay time, H1 high to peripheral terminal switching 
td(GPIOH1 H ) from input to output 


15 


15 


13 


ns 



H3 



Execution of Store of 
Peripheral Control 
Register 



HI 



1/0 
Control 
Bit 



Peripheral 
Terminal 



Figure 6. Change of Peripheral Terminal From General-Purpose Input to Output Mode 
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timing parameters for timer terminal 



NO. 




'320C30-28 


'320C30-33 
'320C31-33 


UNIT 


MIN MAX 


MIN MAX 


99 


Setup time, 
tsu(TCLK-H1L) TCLK ext 

before H1 low' 


TCLK ext 


15 


12 


ns 


100 


Hold time, 
th(TCLK-H1 L) TCLK ext after 
H1 lowt 


TCLK ext 


0 


0 


ns 


101 


Delay time, H1 
td(TCLK-H1 H) h'gh to TCLK 
int valid 


TCLK int 


15 


12 


ns 


102 


t .xx Cvc,e time > 
tc(TCLK) TCLKT 


TCLK ext 


tc(H)*2.6* 


tc(H) x 2.6* 


ns 


TCLK int 


*c(H) x 2 tc( H ) x 232* 


*C(H) x 2 tc( H ) x 232t 


ns 


103 


Pulse duration, 
tw(TCLK) TCLK 

high /lowt 


TCLK ext 


tc(H) + 10* 


tc(H) + 12* 


ns 


TCLK int 


[tc(TCLK)/2]- 5 [tc(TCLK)/2]+5 


[tc(TCLK)/2]-15 [tcfTCLK)/2]+5 


ns 




NO. 




'320C31-27 


'320C31-40 


UNIT 


MIN MAX 


MIN MAX 


99 


Setup time, 
tsu(TCLK-H1L) TCLK ext 

before H1 lowt 


TCLK ext 


15 


10 


ns 


100 


Hold time, 
th(TCLK-H1 L) TCLK ext after 
H1 lowt 


TCLK ext 


0 


0 


ns 


101 


Delay time, H1 
td(TCLK-H1H) high to TCLK 
int valid 


TCLK int 


13 


9 


ns 


102 


t ™, c y c|e time - 


TCLK ext 


tc(H)*2.6* 


tc( H ) x 2.6* 


ns 


tcCTCLK) TCLK t 


TCLK int 


tc(H)x2 t c (H)x232* 


tc(H) x 2 t c ( H ) x 232* 


ns 


103 


Pulse duration, 


TCLK ext 


tc(H) + 12* 


tc(H) + 12* 


ns 


tw(TCLK) TCLK 

high /lowt 


TCLK int 


[tcfTCLK) /2]-1 5 [tc(TCLK) /2J+5 


[tc(TCLK)/2]-5 [tcfTCLK) /2]+5 


ns 



t Timing parameters 99 and 1 00 are applicable for a synchronous input clock. Timing parameters 1 02 and 1 03 are applicable for an asynchronous 

input clock. 
* Assured by design but not tested 




NOTE A. Period and polarity of valid logic level are specified by contents of internal control registers. 

Figure 7. Timer-Terminal Timing 
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timing parameters for SHZ [P = t C (ci)l 



NO. 




'320C30-28 


'320C30-33 
'320C31 -33 


'320C31 -27 


'320C31-40 


UNIT 


MIN MAXt 


MIN MAXt 


MIN MAXt 


MIN MAXt 


104 


Disable time, SHZ low to all 0, I/O 
tdis(SHZ) njgn jmpedancet 


0 3P + 15 


0 3P + 15 


0 3P + 15 


0 3P + 15 


ns 


105 


Enable time, SHZ high to all 0,1/0 
ten(SHZ) activeT 


0 2P 


0 2P 


0 2P 


0 2P 


ns 



t These values are derived from characterization but not tested. 




* Enabling SHZ destroys SMJ320C3x register and memory contents. Assert SHZ and reset the SMJ320C3x to restore it to a known condition. 

Figure 8. Timing for SHZ 
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SMJ320C30 part order information 



DEVICE 


TECHNOLOGY 


POWER 
SUPPLY 


OPERATING 
FREQUENCY 


PACKAGE TYPE 


PROCESSING 
LEVEL 


SMJ320C30GBM28 


0.8-^im CMOS 


5 V ± 5% 


28 MHz 


Ceramic 181 -pin PGA 


Class B 


SM320C30GBM28 


0.8-nm CMOS 


5V±5% 


28 MHz 


Ceramic 1 81 -pin PGA 


Std 


5Qfi9-QnR9fifi1 MYA 

OC7U^-9UO^OU I IVIArN 


fi R-nm riuin^ 
u.o-|j,in v*»ivivjo 


5 V ± 5% 


Oft MU 7 


f^ornmif* 1ft1-nin PfiA 


DF^C fiiuiri 

UUOv OIVI Ls 


SMJ320C30GBM33 


0.8-^m CMOS 


5V±10% 


33 MHz 


Ceramic 181 -pin PGA 


Class B 


SM320C30GBM33 


0.8-nm CMOS 


5V±10% 


33 MHz 


Ceramic 181 -pin PGA 


Std 


5962-9052603MXA 


0.8-nm CMOS 


5V±10% 


33 MHz 


Ceramic 181 -pin PGA 


DESC SMD 


SMJ320C30HFGM28 


0.8-nm CMOS 


5V±5% 


28 MHz 


Ceramic 1 96-pin quad flatpack with 
nonconductive tie bar 


Class B 


SM320C30HFGM28 


0.8-^m CMOS 


5 V ± 5% 


28 MHz 


Ceramic 196-pin quad flatpack with 
nonconductive tie bar 


Std 


5962-9052601 MUA 


0.8-nm CMOS 


5V±5% 


28 MHz 


Ceramic 1 96-pin quad flatpack with 
nonconductive tie bar 


DESC SMD 


SMJ320C30HFGM33 


0.8-\xm CMOS 


5V±10% 


33 MHz 


Ceramic 196-pin quad flatpack with 
nonconductive tie bar 


Class B 


SM320C30HFGM33 


0.8-nm CMOS 


5V±10% 


33 MHz 


Ceramic 1 96-pin quad flatpack with 
nonconductive tie bar 


Std 


5962-9052603MUA 


0.8-^im CMOS 


5V±10% 


33 MHz 


Ceramic 196-pin quad flatpack with 
nonconductive tie bar 


DESC SMD 



SMJ 



320 



30 GB 



28 



PREFIX 

SMJ = 883 Class B 

SM = Standard Processing 



DEVICE FAMILY 

320 = SMJ320 Family 



TECHNOLOGY 

C = CMOS 



DEVICE 



SPEED RANGE 

28 = 28 MHz 
33 = 33 MHz 



TEMPERATURE RANGE 

M = -55°C to125°C 
L = 0°C to 70°C 



PACKAGE TYPE 

GB * Pin Grid Array (PGA) 
HFG = 1 96-Pin Quad Flatpack with a 
nonconductive tie bar 



Figure 9. SMJ320C30 Device Nomenclature 
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SMJ320C31 part order information 



DEVICE 


TECHNOLOGY 


POWER 
SUPPLY 


OPERATING 
FREQUENCY 


PACKAGE TYPE 


PROCESSING 
LEVEL 


SMJ320C31 GFAM27 


0.8-nm CMOS 


5V±10% 


27 MHz 


Ceramic 141 -pin staggered PGA 


Class B 


SM320C31GFAM27 


0.8-nm CMOS 


5V±10% 


27 MHz 


Ceramic 1 41 -pin staggered PGA 


Std 


SMJ320C31 GFAM33 


0.8-^im CMOS 


5 V ± 1 0% 


33 MHz 


Ceramic 1 41 -pin staggered PGA 


Class B 


SM320C31GFAM33 


0.8-nm CMOS 


5V±10% 


33 MHz 


Ceramic 141 -pin staggered PGA 


Std 


SMJ320C31 GFAM40 


0.8-jxm CMOS 


5 V±5% 


40 MHz 


Ceramic 141 -pin staggered PGA 


Class B 


SM320C31GFAM40 


0.8-nm CMOS 


5V±5% 


40 MHz 


Ceramic 1 41 -pin staggered PGA 


Std 


SMJ320C31 HFGM27 


0.8-jim CMOS 


5V±10% 


27 MHz 


Ceramic 1 32-pin quad flatpack with a 
nonconductive tie bar 


Class B 


SM320C31HFGM27 


0.8-jim CMOS 


5V±10% 


27 MHz 


Ceramic 132-pin quad flatpack with a 
nonconductive tie bar 


Std 


SMJ320C31 HFGM33 


0.8-nm CMOS 


5V±10% 


33 MHz 


Ceramic 1 32-pin quad flatpack with a 
nonconductive tie bar 


Class B 


SM320C31HFGM33 


0.8-fim CMOS 


5V±10% 


33 MHz 


Ceramic 1 32-pin quad flatpack with a 
nonconductive tie bar 


Std 


SMJ320C31 HFGM40 


0.8-fxm CMOS 


5V±5% 


40 MHz 


v/oiciiiiiw i i vjuau i icii^/ciwrx whii ci 

nonconductive tie bar 


Class B 


SM320C31HFGM40 


0.8-nm CMOS 


5 V ± 5% 


40 MHz 


Ceramic 132-pin quad flatpack with a 
nonconductive tie bar 


Std 


5962-9205801 MXA 


0.8-jAm CMOS 


5V±10% 


27 MHz 


141 -pin CPGA 


DESC SMD 


5962-9205801 MYA 


0.8-nm CMOS 


5V±10% 


27 MHz 


132-PIN CQFP 


DESC SMD 


5962-9205802MXA 


0.8-^m CMOS 


5V±10% 


33 MHz 


141 -pin CPGA 


DESC SMD 


5962-9205802MYA 


0.8-nm CMOS 


5V±10% 


33 MHz 


132-PIN CQFP 


DESC SMD 



SMJ 320 C 31 GFA M 27 



PREFIX 

SMJ = 883 Class B 

SM = Standard Processing 



DEVICE FAMILY 

320 = SMJ320 Family 



TECHNOLOGY 

C = CMOS 



DEVICE 

Figure 10. SMJ320C31 Device Nomenclature 
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SPEED RANGE 

27 = 27 MHz 
33 = 33 MHz 
40 = 40 MHz 



TEMPERATURE RANGE 

M = -55°C to125°C 
L = 0°C to 70°C 



PACKAGE TYPE 

GFA = 141 -Pin Staggered Pin Grid Array 
HFG = 196-Pin Quad Flatpack with a 
nonconductive tie bar 



SMJ320C3X 
DIGITAL SIGNAL PROCESSOR 

SGUS014A- FEBRUARY 1991 - REVISED SEPTEMBER 1994 



MECHANICAL DATA 
SMJ320C30 HFG 196-lead ceramic quad flat pack with a nonconductlve tie bar 



Pin 1 Indicator < 



63,63 ( 



63,12 ( 



1,55(0.061) 
4 x 0 1,50 (0.059) 



64,52 (2.540) 

63,50 (2.500) 

34,7 (1.365) 
33,7 (1.325) 

-30,5 (1.200) BSC 



TYP- 



TYP- 



0,51 (0.020) 
MAX 




196 x _ _ __v TYP 
0,18 (0.007) 



3,30 
(0.130) 
MAX 



0,20 (0.008) 



0,65 (0.025) TYP 
Ceramic Detail A 



0,10 (0.004) 

0,30 (0.012) 
0,15 (0.006) 
(at braze pads) _ 



1,02(0.040) 
0,76 (0.030) 



Detail B 



Thermal Resistance Characteristics 



PARAMETER 


°c/w 


Rejc 

R&JA 


1.3 
28.9 



2,67 (0.105) MAX 



Detail B 

ALL LINEAR DIMENSIONS ARE IN MILLIMETERS AND PARENTHETICALLY IN INCHES 
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DIGITAL SIGNAL PROCESSOR 
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MECHANICAL DATA 
SMJ320C30 196-lead ceramic quad flatpack (HU suffix) 



RECOMMENDED FINAL LEAD-FORM DIMENSIONS FOR BOARD MOUNT 



r 4,19 (0.165) MAX 



Seating Plane 



BET 



. 0,38 (0.015) 
0,15 (0.006) 
With Lead Finish 




- 0,64 (0.025) TYP 



30,48 (1.200) BSC 

34,67 (1.365) _ 
33,91 (1.335) 

39,75 (1.565) 
38,99 (1.535) 



0,25 (0.010) MIN Radius 



0.004 



,1,02 (0.040) 



0,51 (0.020) 



0,36 (0.014) 
0,05 (0.002) 




0,84 (0.033) REF 



0,43 (0.017) 
0,28 (0.011) 



Radius 



0,76 (0.030) , 
0,51 (0.020) 



0,25 (0.010) 
0,10 (0.004) 
With Lead Finish 



ALL LINEAR DIMENSIONS ARE IN MILLIMETERS AND PARENTHETICALLY IN INCHES 



NOTES: A. Tl does not offer MIL-SPEC part in formed lead configuration. 

B. Lead forming should be performed at customer's facility or subcontracted. 
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DIGITAL SIGNAL PROCESSOR 



SGUS01 4A- FEBRUARY 1 991 - REVISED SEPTEMBER 1 994 



MECHANICAL DATA 
SMJ320C30 181 -pin ceramic grid array (GB suffix) 



TOP VIEW 



Pin A1 
Corner Indicator 




Thermal Resistance Characteristics 



PARAMETER 


°c/w 


Rgjc 

R0JA 


1.1 

26.6 



4,70 (0.185) 
3,55(0.140) ^ 



37,6 (1.480) 



SIDE VIEW 




3,56(0.140) 
3,05 (0.120) 



2,54 (0.100) TYP 



0,406(0.016) 
(180 Places) 
TYP 



1,40(0.055) 
] J 1,14(0.045) 

— H 1,27 (0.050) NOM 



DIA (4 Places) 



35,6 (1.400) 
REF 



Index Corner 



— H I*— BOTTOM VIEW 

® ®®®®®®®®®®®®® ® 

®®®®®®®®®®®®®®® 
® ®®®®®®®®®®®®® ® 
®®®® ® ®®®® 
®®®® ®®®® 
®®®® ®®®® 
®®®®® ®®®®® 

®®®® ExtraPin ®®® ® 

® ®®® + ®®® ® 

®®®®® ® ®®®® 
®®®®®®®®®®®®®®® 
® ®®®®®®®®®®®®® ® 
® ®®®®®®®®®®®®® ®- 
©]®®®®®®®®®®®®®(j§y — r 

9 1 2 3 A 5 fi 7 ft 9 10 11 12 13 14 15 ' 



2,54 (0.100) TYP 



ALL LINEAR DIMENSIONS ARE IN MILLIMETERS AND PARENTHETICALLY IN INCHES 
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DIGITAL SIGNAL PROCESSOR 

SGUS014A- FEBRUARY 1991 - REVISED SEPTEMBER 1994 



MECHANICAL DATA 
SMJ320C30 244-pin TAB frame (PG5) socket, 203 OLB/ILB 0.25- mm OLB pitch 



k P0.25 ± 0.01 x 50 = 12.75 ± 0.02 ►! 




NOTES: A. Lead pitch in OLB windows is 250 urn. 

B. OLB lead width is 1 00 urn ± 20 \im. 

C. Dimensions reference centerline to outside edge of lead. 

D. P0.25 ± 0.01 x 49 = 1 2.25 ± 0.02. 
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SMJ320C3X 
DIGITAL SIGNAL PROCESSOR 



SGUS014A- FEBRUARY 1991 - REVISED SEPTEMBER 1994 



MECHANICAL DATA 

SMJ320C30 TAB (PG5) 244-pin socket, 203 OLB/ILB 0.25-mm OLB pitch (continued) 



1404 \im \. 
(55.3 mll)_l_ k 

1016 jim J 
(40 mil) 



1016 \im 
(40 mil). 



1404 nm_^" 
(55.3 mil) 



± 



1814 urn. 
(71.4 mil) 



OLB 



ILB 



DIE 



10,34 mm 
(407 mil) 



Z 



31 



1016 jim 
(40 mil) 



1016 \xm . 
(40 mil) 



16 mm 
(630 mil) 



2,25 mm 
(89 mil) 



16 mm 
(630 mil) 



11,16 mm 
(439 mil) 



2,25 mm 
(89 mil) 



1814 (am 
| (71 .4 mil) 
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DIGITAL SIGNAL PROCESSOR 

SGUS014A- FEBRUARY 1991 - REVISED SEPTEMBER 1994 



SMJ320C30 (rev 5) Inner Lead Bond Information for TAB 
(tape automated bonding) 



Die Side Number 4 



Pad Number One 




Die Designator 



Die Side Number 



Zero-Zero 
(origin) 



Die Side Number 2 



Figure 11. SMJ320C30 Die Numbering Format 
(Reference Table 1) 

The inner lead bond (I LB) pitch for the TAB ieadframe is the same as the die bond pad pitch. Table 1 provides 
a reference for the following: 

A. The TAB lead numbers. The TAB lead numbers are the same as the die bond pad numbers. 

B. The 'C30 signal identities in relation to the pad numbers 

C. Which signal functions fan out to more than one test pad location. (There are 203 bond pad locations, 
203 TAB leads, and 244 test pad locations.) 

D. The 'C30 X,Y coordinates, where bond pad 51 serves as the origin (0,0) 

E. The I LB pitch for the TAB Ieadframe 
In addition, the following notes are significant: 

F. X,Y coordinate data is in microns. 

G. Coordinate origin is at 0,0 (center of bond pad 51). 

H. Average pitch is 186 microns (7.33 mils). 

I. Smallest pitch value is 156,8 microns (6.173 mils). 

J. The active silicon dimensions are 10224,00 fxm x 11032,00 u.m (402.52 mils x 434.33 mils). 
K. The die size is approximately 10337,80 jim x 11150,6 urn (409.00 mils x 439.00 mils). 
L Distance from diced silicon to polyimide support ring is 1016,0 \im (40 mils). 
M. Bond pad dimensions are 115,00 urn x 115,00 urn. 

N. Center of bond pad to edge of die ranges from 1 80 urn -220 ^m (7.1 mils-8.6 mils). The range of 40 urn 
exists since the dicing process will result in some tolerance. Due to the consistency and precision of the 
bond pad locations in reference to each other, the center of bond pad 51 was chosen as the origin. 
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DIGITAL SIGNAL PROCESSOR 

SGUS01 4A - FEBRUARY 1991 - REVISED SEPTEMBER 1994 



Table 1. SMJ320C30 Die Pad/Tab Lead Information : rev 5 (0,8 \im) 



DIE SIDE #1 


C30 DIE 


DIE/TAB 


TAB C30 


X COORDINATE OF 


Y COORDINATE OF 


PITCH OF LEAD (#, #) 


BOND PAD 


BOND PAD 


TEST PAD 


THE DIE BOND PAD 


THE DIE BOND PAD 


REFERENCE WHICH DIE 


LOCATIONS 


IDENTITY 


LOCATIONS 


BOND PADS 


1 

i 


pn\/ nn 

"UVDD 


1 o 




SJOOO.UU 


1 qr on h j>\ 


O 
c. 




q a 

0, H 




Q*3ft7 an 


1 Rft ftn /o ^\ 

1 OO.OU \^,oy 


o 
o 


nnn 


C 

O 




qiqq on 


1 qo nn fa a\ 


4 


FSRO 


g 




Q007 90 


1 84 00 I A 5) 


5 


CLKRO 






8823 20 


1 92 00 f5 6) 


O 


PI k'YO. 


o 




oftoi on 

OOO 1 .£U 


1 84 nn (r 7\ 

I Ot.UU ^O, / ) 


7 


COVft 

roAO 


9 




8447.20 


1 qo nn f7 q\ 


8 


UAU 


10 




oocc on 


1 oa nn /o. q\ 


y 


1 L>LI\U 


1 1 
ii 




on7i on 
out 1 .<£U 


1 qo nn iq 1 n\ 


1 n 

1U 


TfM 1^1 


1 o 




70.70 on 


1 ra nn H n 1 1 \ 


-H 
i i 


PMI Ift 


1 Q 
lO 




■7ftQ.fi; on 


1 qo nn M 1 1 o^ 


1 0 


Ynn 

AUU 








1 84 nn M o 1 ^) 


13 


XD1 


15 




7319.20 


1 92 00 M 3 1 4) 


14 


XD2 


16 




7127.20 


180 20 M4 15) 


1 5 


mnunn 


17 18 




fiQ47 nn 


1Qc on He ic\ 


16 


IUUV[JQ 


19 20 




R7C1 on 


1 88 fiO M R 1 7^ 


■j7 


XD3 


21 




ftpcq on 


1 84 nO (i 7 1 8^ 


1 8 


XD4 


22 




ft^QQ on 


1 Q2 00 M 8 1 9^ 


19 


XD5 


23 




6207.20 


1 84 00 M 9 20^ 


on 


Yn« 






ftnoq on 


1 qo nn fon 01 \ 


01 


Yn7 


OR 

&o 




cooi on 
OOO 1 xU 


184 nn foi oo\ 


oo 


yr\Q 


Oft 




«%ftA7 on 


1 qo nn /oo o^^ 


23 


XD9 


27 




rarr on 


1 84 nn 24\ 


24 


XD10 


28 


-423.80 


5271 .20 


188.20 (24,25) 


25 


V DD 


29 30 


5083 00 


1 95 20 (25 26) 


26 


Vnn 
VDD 


31 32 




4887 80 


1 56.80 (26,27) 


27 


Voo 
V SS 


3*? 34 3*5 




4731.00 


1 95.20 (27,28) 


28 


Voo 
V SS 


36, 37 




4535.80 


1 68.60 (28,29) 


29 


XD11 


38 




4367.20 


1 84.00 (29,30) 


30 


XD12 


39 




4183.20 


192.00 (30,31) 


31 


XD13 


40 




3991 .20 


184.00 (31,32) 


32 


XD14 


41 




3807.20 


192.00 (32,33) 


33 


XD15 


42 




3615 20 


184.00 (33,34) 


34 


XD16 


43 




3431.20 


192.00 (34,35) 


35 


XD17 


44 




3P39 on 


1 84 00 (3*5 36) 


oo 


AU I O 


AR 
*fO 




•anKc on 


1 9? no fafi ^7) 


Or 


AU 1 9 


Aft 




oocq on 


1 84 nn fa7 ^8) 


38 


XD20 


47 




2679.20 


1 92 00 f38 39) 


39 


XD21 


48 




2487.20 


184.00(39,40) 


40 


XD22 


49 




2303.20 


192.00(40,41) 


41 


XD23 


50 




2111.20 


184.00(41,42) 


42 


XD24 


51 




1927.20 


192.00(42,43) 


43 


XD25 


52 




1735.20 


184.00(43,44) 


44 


XD26 


53 




1551.20 


192.00(44,45) 


45 


XD27 


54 




1359.20 


184.00(45,46) 


46 


XD28 


55 




1175.20 


192.00(46,47) 


47 


XD29 


56 




983.20 


184.00(47,48) 


48 


XD30 


57 




799.20 


180.20(48,49) 


49 


IODVdd 


58, 59 




619.00 


195.20(49,50) 


50 


IODVdd 


60, 61 




423.80 
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DIGITAL SIGNAL PROCESSOR 

SGUS014A- FEBRUARY 1991 - REVISED SEPTEMBER 1994 



Table 1. SMJ320C30 Die Pad/Tab Lead Information : rev 5 (0.8 \im) (Continued) 



DIE SIDE #2 


C30 DIE 
BOND PAD 


DIE/TAB 
BOND PAD 


TAB C30 
TEST PAD 


Y COORDINATE OF 
THE DIE BOND PAD 


Y COORDINATE OF 
THE DIE BOND PAD 


PITCH OF LEAD (#, #) 
REFERENCE WHICH DIE 


LOCATIONS 


IDENTITY 


LOCATIONS 


BOND PADS 


51 


DVss 


62, 63 


0.00 




195.20(51,52) 


52 


DVss 


64 


195.2 




179.60(52, 53) 


53 


CVss 


65, 66 


374.80 




195.20(53,54) 


54 


CVss 


67 


570.00 




176.60(54, 55) 


55 


XD31 


68 


746.60 




192.00 (55,56) 


56 


A23 


69 


938.60 




200.00 (56, 57) 


57 


A22 


70 


1138.60 




200.00 (57, 58) 


58 


A21 


71 


1338.60 




192.00(58, 59) 


59 


A20 


72 


1530.60 




200.00 (59, 60) 


60 


A19 


73 


1730.60 




192.00(60, 61) 


61 


A18 


74 


1922.60 




200.00(61,62) 


62 


A17 


75 


2122.60 




200.00 (62, 63) 


63 


A16 


76 


2322.60 




192.00(63, 64) 


64 


A15 


77 


2514.36 




200.00 (64, 65) 


65 


A14 


78 


2902,80 




188.20(65, 66) 


66 


ADV DD 


79, 80 


2714.60 




195.20(66,67) 


67 


ADVqd 


81 


2902.80 




176.60(67, 68) 


68 


A13 


82 


3098.00 




200.00 (68, 69) 


69 


A12 


83 


3274.60 




192.00(69, 70) 


70 


A11 


84 


3474.60 




200.00 (70, 71) 


71 


A10 


85 


3666.60 




200.00(71,72) 


72 


A9 


86 


3866.60 




192.00(72, 73) 


73 


A8 


87 


4258.60 




200.00 (73, 74) 


74 


A7 


88 


4458.60 




192.00(74, 75) 


75 


A6 


89 


4650.60 




196.20(75, 76) 


76 


Vdd 


90, 91 


4846.80 


0.00 


195.20(76,77) 


77 


Vdd 


92, 93 


5042.00 




172.80(77, 78) 


78 


vss 


94, 95 


5214.80 




195.20(78, 79) 


79 


vss 


96, 97 


2410.00 




168.60(79, 80) 


80 


A5 


98 


5578.60 




200.00(80, 81) 


81 


A4 


99 


5778.60 




192.00(81,82) 


82 


A3 


100 


5970.60 




200.00 (82, 83) 


83 


A2 


101 


6170.60 




200.00 (83, 84) 


84 


A1 


102 


6370.60 




192.00(84, 85) 


85 


AO 


103 


6562.60 




212.20(85, 86) 


86 


EMU0 


104 


6774.80 




216.00(86, 87) 


87 


EMU1 


105 


6990.80 




208.00 (87, 88) 


88 


EMU2 


106 


7198.80 




203.80 (88, 89) 


89 


EMU3 


107 


7402.60 




204.20 (89, 90) 


90 


EMU4 


108 


7606.80 




216.00(90, 91) 


91 


MC/MP 


109 


7822.80 




203.80(91,92) 


92 


XA12 


110 


8026.60 




192.00 (92, 93) 


93 


XA11 


111 


8218.60 




200.00 (93, 94) 


94 


XA10 


112 


8418.60 




192.00(94, 95) 


95 


XA9 


113 


8610.60 




200.00 (95, 96) 


96 


XA8 


114 


8810.60 




200.00 (96, 97) 


97 


XA7 


115 


9010.60 




192.00(97, 98) 


98 


XA6 


116 


9202.60 




196.20(98, 99) 


99 


ivss 


117,118 


9398.80 




195.20(99,100) 


100 


ivss 


119 


9594.00 




164.80 (100,101) 


101 


DV S S 


120, 121 


9758.80 




195.20(101,102) 


102 


DVss 


122 


9954.00 







E-S2 
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Table 1. SMJ320C30 Die Pad/Tab Lead Information : rev 5 (0.8 ^m) (Continued) 



DIE SIDE #3 


C30 DIE 


DIE/TAB 


TAB C30 


X CUUHUINATc Up 


Y CUUHUINAI t Ur 


PITCH OF LEAD (#, #) 


BOND PAD 
LOCATIONS 


BOND PAD 
IDENTITY 


TEST PAD 
LOCATIONS 


1 PIC UIC DUNU rMU 


tup nip DAun dah 


REFERENCE WHICH DIE 
BOND PADS 


103 


ADVdd 


123, 124 




430.60 


195.20 (103,104) 


104 


ADVqd 


125, 126 




625.80 


168.60 (104,105) 


105 


XA5 


127 




764.40 


192.00(105,106) 


106 


XA4 


128 




986.40 


184.00 (106,107) 


107 


XA3 


129 




1170.40 


192.00 (107,108) 


108 


XA2 


130 




1362.40 


184.00 (108,109) 


109 


XA1 


131 




1546.40 


192.00(109,110) 


110 


XA0 


132 




1738.40 


184.00 (110,111) 


111 


D31 


133 




1922.40 


192.00 (111,112) 


112 


D30 


134 




2114.40 


184.00(112,113) 


113 


D29 


135 




2298.40 


192.00(113,114) 


114 


D28 


136 




2490.40 


184.00 (114,115) 


115 


D27 


137 




2674.40 


192.00 (115,116) 


116 


D26 


138 




2866.40 


180.20 (116,117) 


117 


DDVqd 


139, 140 




3046.60 


195.20 (117,118) 


118 


DDVdd 


141, 142 




3241 .80 


168.60 (118,119) 


119 


D25 


143 




3410.40 


184.00(119,120) 


120 


D24 


144 




3594.40 


192.00 (120,121) 


121 


D23 


145 




3786.40 


184.00 (121,122) 


122 


D22 


146 




3970.40 


192.00 (122,123) 


123 


D21 


147 




4162.40 


184.00 (123,124) 


124 


D20 


148 




4346.40 


192.00(124,125) 


125 


D19 


149 


10377.80 


4538.40 


184.00 (125,126) 


126 


D18 


150 




4722.40 


188.20 (126,127) 


127 


VDD 


151,152 




4910.60 


195.20 (127,128) 


128 


Vdd 


153,154,155 




5105.80 


156.80 (128,129) 


129 


vss 


156, 157 




5262.60 


195.20 (129,130) 


130 


vss 


158, 159 




5457.80 


168.60 (130,131) 


131 


D17 


160 




5626.40 


184.00 (131,132) 


132 


D16 


161 




5810.40 


192.00 (132,133) 


133 


D15 


162 




6002.40 


184.00 (133,134) 


134 


D14 


163 




6186.40 


192.00 (134,135) 


135 


D13 


164 




6378.40 


184.00 (135,136) 


136 


D12 


165 




6562.40 


192.00 (136,137) 


137 


D11 


166 




6754.40 


184.00 (137,138) 


138 


D10 


167 




6938.40 


192.00 (138,139) 


139 


D9 


168 




7130.40 


184.00 (139,140) 


140 


D8 


169 




7314.40 


192.00 (140,141) 


141 


D7 


170 




7506.40 


184.00 (141,142) 


142 


D6 


171 




7690.40 


192.00 (142,143) 


143 


D5 


172 




7882.40 


184.00 (143,144) 


144 


D4 


173 




8066.40 


192.00 (144,145) 


145 


D3 


174 




8258.40 


184.00 (145,146) 


146 


D2 


175 




8442.40 


192.00 (146,147) 


147 


D1 


176 




8634.40 


184.00 (147,148) 


148 


DO 


177 




8818.40 


192.00 (148,149) 


149 


H1 


178 




9010.40 


184.00(149,150) 


150 


H3 


179 




9194.40 


180.20 (150,151) 


151 


DDVdd 


180, 181 




9374.60 


195.20(151,152) 


152 


DDVdd 


182,183 




9569.80 
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Table 1. SMJ320C30 Die Pad/Tab Lead Information : rev 5 (0.8 \im) (Continued) 



DIE SIDE #4 


C30 DIE 
BOND PAD 
LOCATIONS 


DIE/TAB 
BOND PAD 
IDENTITY 


TAB C30 
TEST PAD 
LOCATIONS 


Y COORDINATE OP 
THE DIE BOND PAD 


Y COORDINATE OF 
THE DIE BOND PAD 


PITCH OF LEAD(#,#) 
REFERENCE WHICH DIE 
BOND PADS 


153 


DV S S 


184 


9947.20 




195.20 (153,154) 


154 


DV S S 


185, 186 


9752.00 




164.80(154,155) 


155 


cvss 


187 


9587.20 




195.20 (155,156) 


156 


cvss 


188, 189 


9392.00 




175.00 (156,157) 


157 


X2/CLKIN 


190 


9217.00 


9986.80 


173.20(157,158) 


158 


X1 


191 


9043.80 


9986.80 


347.80(158,159) 


159 


Vsups 


192,193 


8696.00 




160.60 (159,160) 


160 


Vbbp 


194 


8535.40 




600.00(160,161) 


161 


EMU5 


195 


7935.40 




196.00 (161,162) 


162 


XRDY 


196 


7739.40 




188.00 (162,163) 


163 


MSTRB 


197 


7551.40 




192.00(163,164) 


164 


IOSTRB 


198 


7359.40 




184.00 (164,165) 


165 


XRW 


199 


7175.40 




184.00 (165,166) 


166 


HOLDA 


200 


6991.40 




196.20(166,167) 


167 


HOLD 


201 


6795.20 




184.00 (167,168) 


168 


MDV DD 


202 


6611.20 




195.20 (168,169) 


169 


MDVqd 


203, 204 


6416.00 




172.80(169,170) 


170 


RDY 


205 


6243.20 




187.80(170,171) 


171 


STRB 


206 


6055.40 




192.00 (171,172) 


172 


R/W 


207 


5863.40 




196.20(172,173) 


173 


RESET 


208 


5667.20 




187.80(173,174) 


174 


XF1 


209 


5479.40 




184.00(174,175) 


175 


XF0 


210 


5295.40 




184.00(175,176) 


176 


IACK 


211 


5111.40 


9993.60 


196.20 (176,177) 


177 


INTO 


212 


4915.20 




184.00 (177,178) 


178 


V D D 


213, 214 


4731.20 




195.20 (178,179) 


179 


v D d 


215, 216 


4536.00 




164.80 (179,180) 


180 


v S s 


217, 218 


4371.20 




195.20(180,181) 


181 


vss 


219, 220 


4176.00 




172.80 (181,182) 


182 


INT1 


221 


4003.20 




200.00 (182,183) 


183 


INT2 


222 


3803.20 




200.00 (183,184) 


184 


INT3 


223 


3603.20 




200.00(184,185) 


185 


RSV0 


224 


3403.20 




200.00 (185,186) 


186 


RSV1 


225 


3203.20 




200.00 (186,187) 


187 


RSV2 


226 


3003.20 




208.00(187,188) 


188 


RSV3 


227 


2795.20 




200.00 (188,189) 


189 


RSV4 


228 


2595.20 




187.80 (189,190) 


190 


RSV5 


229 


2407.40 




184.00(190,191) 


191 


RSV6 


230 


2223.40 




184.00 (191,192) 


192 


RSV7 


231 


2039.40 




184.00 (192,193) 


193 


RSV8 


232 


1 855.40 




184.00 (193,194) 


194 


RSV9 


233 


1671.40 




192.00 (194,195) 


195 


RSV10 


234 


1479.40 




184.00 (195,196) 


196 


DR1 


235 


1295.40 




184.00 (196,197) 


197 


FSR1 


236 


1111.40 




184.00(197,198) 


198 


CLKR1 


237 


927.40 




184.00 (198,199) 


199 


CLKX1 


238 


743.40 




184.00 (199,200) 


200 


FSX1 


239 


559.40 




184.00(200,201) 


201 


DX1 


240 


375.40 




180.20 (201,202) 


202 


DV S S 


241,242 


195.20 




195.20(202,203) 


203 


DV S s 


243, 244 


0.00 
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MECHANICAL DATA 
SMJ320C31 132-lead nonconductive ceramic tie bar (HFG suffix) 



Pin 1 Indicator - 



51,18(2.015) 
50,55 (1.990) 



51,44(2.025) 
MAX 

24,38 (0.960) 



24,00 (0.945) 
20,45 (0.805) 



20,19(0.795) 



1,77(0.070) 
1,27(0.050) 



0,406(0.016) 
0,228 (0.009) 



0,254 (0.010) 



0,330(0.013) 
0,152 (0.006) 



Detail A 



0,120 (0.005) ^ 

0,35(0.014) 
0,05 (0.002) 
(at braze pads) 




0,13 (0.005) 
TYP 



1 



1,02 (0.040) 
0,76 (0.030) 



Detail B 



Thermal Resistance Characteristics 



PARAMETER 


°c/w 


R0JC 
R0JA 


2.1 
44.3 



2,31 (0.091) 
1,95(0.077) 



Detail B 

ALL LINEAR DIMENSIONS ARE IN MILLIMETERS AND PARENTHETICALLY IN INCHES 
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MECHANICAL DATA 
SMJ320C31 132-Lead ceramic quad flatpack (HU suffix) 



RECOMMENDED FINAL LEAD-FORM DIMENSIONS FOR BOARD MOUNT 



r 3,96 (0.156) MAX 



. 0,38 (0.015) 
0,15 (0.006) 
With Lead Finish 



- 0,64 (0.025) TYP 




jL 0.004"TC 



1,02 (0.040) 
0,50 (0.020) 



29,46 (1.160) 
28,96 (1.140) 




ALL LINEAR DIMENSIONS ARE IN MILLIMETERS AND PARENTHETICALLY IN INCHES 



NOTES: A. Tl does not offer MIL-SPEC part In formed lead configuration. 

B. Lead forming should be performed at customer's facility or subcontracted. 
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MECHANICAL DATA 
SMJ320C31 141 -pin ceramic pin grid array (GFA suffix) 



Pin A1 Indicator 



27,43(1.080) 
26,42(1.040) 



TOP VIEW 



Thermal Resistance Characteristics 



PARAMETER 


°c/w 


R©jc 

R0JA 


4.3 
39.0 



27,43(1.080) 



26,42(1.040) 
SIDE VIEW 



0,66 (0.026) 
0,15(0.006) 



_ 

JL 



3,56(0.140) 
3,05 (0.120) 



T 



0,508 (0.020) 
0,406 (0.016) 



|f ~~ tl 0,86(0. 
ft— 1,22(0.048)' 



3,68 (0.145) 
2,79(0.110) 



,034) TYP 
TYP 



BOTTOM VIEW 



W 



22,86 (0.900) 
REF 



Ooooooooo ®--r 

ooooooooo T 
o eeeooeee ©-J-* 

ooooooooo 
o ooooooooo 
_o o o o 

o o o o o o 

o o o o 

o o o o o o 

o o o o 

o o o o o o 

O O > — Extra Pin O O 

000/ o o o 

o o o o o— 

o ooooooooo 
ooooooooo 

o ooooooooo 

©oooooooo® 



L 2,54 (0.100) TYP 



I 



1,27 (0.050) TYP 



1,51 (0.020) REF 



Index Corner 
x 45° 1 Place 



2 4 6 8 10 12 14 16 18 1,016 (0.040) REF x 45° 3 Places 

3 5 7 9 11 13 15 17 19 



ALL LINEAR DIMENSIONS ARE IN MILLIMETERS AND PARENTHETICALLY IN INCHES 
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MECHANICAL DATA 
SMJ320C31 244-pin TAB frame (PG2) socket, 132 OLB/ILB 0.30-mm OLB pitch 

k P0.30 ± 0.01 x 32 = 9.60 ± 0.02 ►! 




|* P0.30 ± 0.01 x 32 = 9.60 ± 0.02 ►) 



NOTES: A. Lead pitch in OLB windows is 300 urn. 

B. OLB lead width is 1 20 nm ± 30 urn. 

C. Dimensions reference centerline to outside edge of lead. 

D. P0.30 ± 0.01 x 32 = 9.60 ± 0.02. 



E-58 
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MECHANICAL DATA 

SMJ320C31 TAB (PG2) 244-pln socket, 132 OLB/ILB 0.30 -mm OLB pitch (continued) 



635 \im _J 
(25 mil) 



635 \im 
(25 mil). 



n 



1404 urn 

(55.3 mll)_l k 

" "I " 



1404 urn J 
(55.3 mil) 



i i 



i i 



OLB 



ILB 



DIE 



10,64 mm 
(419 mil) 



Z 



-A 



I I 
I I 



J I 



635 urn 
(25 mil) 



635 \im — [4— ►! 
(25 mil) r | 



16 mm 
(630 mil) 



1 



2,25 mm 
(89 mil) 



14 mm 
(551 mil) 



10,54 mm 
(415 mil) 



1814 urn 
I (71 .4 mil) 



Texas 
Instruments 

POST OFFICE BOX 1443 • HOUSTON, TEXAS 77251-1443 E-59 



SMJ320C3X 

DIGITAL SIGNAL PROCESSOR 

SGUS014A- FEBRUARY 1991 - REVISED SEPTEMBER 1994 



Pad Number One 



SMJ320C31 Inner Lead Bond Information for TAB 
(tape automated bonding) 

132 Die Side Number 4 100 

' BBflflBflflBflt 



Die Side Number 




Die Designator 



Zero-Zero _ 
(origin) 



Die Side Number 2 



Figure 12. SMJ320C31 Die Numbering Format 
(Reference Table 2) 

The inner lead bond (I LB) pitch for the TAB leadframe is the same as the die bond pad pitch. Table 2 provides 
a reference for the following: 

A. The TAB lead numbers. The TAB lead numbers are the same as the die bond pad numbers. 

B. The 'C31 signal identities in relation to the pad numbers 

C. Which signal functions fan out to more than one test pad location. (There are 1 32 bond pad locations, 
132 TAB leads, and 244 test pad locations.) 

D. The 'C31 X,Y coordinates, where bond pad 34 serves as the origin (0,0) 

E. The I LB pitch for the TAB leadframe 
In addition, the following notes are significant: 

F. X,Y coordinate data is in microns. 

G. Coordinate origin is at 0,0 (center of bond pad 34). 

H. Average pitch is 233 microns (11 .2 mils). 

I. Smallest pitch value is 179,6 microns (7.07 mils). 

J. The active silicon dimensions are 10215,20 um x 10324,00 um (402.17 mils x 406.46 mils). 

K. The die size is approximately 10490,20 \irr\ x 10566,40 jim (413.00 mils x 416.00 mils). 

L. Distance from diced silicon to polyimide support ring is 889,0 (35 mils). 



E-60 
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Table 2. SMJ320C31 Die Pad /TAB Lead Information : rev 2.0 (0.8 \im) 



DIE SIDE#1 


C31 DIE 


DIE/TAB 


TAB C31 


X COORDINATE OF 


Y COORDINATE OF 


PITCH OF LEAD(#,#) 


BOND PAD 


BOND PAD 


TEST PAD 


THE CENTER OF 


THE CENTER OF 


REFERENCE WHICH DIE 


LOCATIONS 


IDENTITY 


LOCATIONS 


BOND PAD 


BOND PAD 


BOND PADS 


1 


SUBS 






9649.40 


314.20 (1,2) 


2 


SHZ 






9335 20 


279 60 (2 ai 

far 9ivV IfafUl 


3 


D\/SS 






Q055 60 


278 80 ft* d\ 


4 


TCLK0 






8776 80 

Off u.uu 


270 00 (A 5\ 


5 


PVnn 






8506 80 


283 60 (5 61 


6 


TCLK1 






8223.20 


•570 on /fi 7t 
Of \o, f J 


7 


EMU3 






7ft*? 1 no 


070 40 f7 ft\ 


8 


EMU0 






7580 60 


303 20 (R Q\ 


g 


EMU1 






7277.40 


300 80 f9 1 0^ 


10 


EMU2 






6Q76 60 


240 00 M 0 1 1 \ 


11 


MCBL/MP 






6736.60 


342.60 (11,12) 


12 








6394 00 


203 00 M 2 1 S\ 


13 


A23 






6191.00 


285.60 (13,14) 


14 


A22 






5895 40 


330 80 fl 4 1 55 


15 


V UUL 






5564.60 


180.40 (15,16) 


16 


Vnni 
V UUL 






5984.20 


397.40 (16,17) 


17 


A21 




- 484.80 


4986.80 


282.00 (17,18) 


18 


A20 






4704.80 


338.00 (18,19) 


19 


VsSL 






4366.80 


1 80.40 (1 9,20) 


20 


DVqq 






4186.40 


322.60 (20,21) 


21 


A19 






3863.80 


277.40 (21 ,22) 


22 


AV D D 






3586.40 


295.60 (22^23) 


23 


A18 






3290.80 


276.20 (23,24) 


24 


A17 






3014.60 


290.20 (24,25) 


25 


A16 






2724.40 


267.00 (25,26) 


26 


A15 






2457.40 


284.80 (26,27) 


27 


A14 






2172.60 


346.60 (27,28) 


28 


A13 






1826.00 


276.00 (28,29) 


29 


A12 






1550.00 


278.20 (29,30) 


30 


A11 






1271.80 


282.80 (30,31) 


31 


av D d 






989.00 


273.80(31,32) 


32 


A10 






715.20 


274.20 (32,33) 


33 


cv S s 






441.00 
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Table 2. SMJ320C31 Die Pad /TAB Lead Information : rev 2.0 (0.8 urn) (Continued) 



DIE SIDE #2 



C31 DIE 
BOND PAD 
LOCATIONS 



DIE/TAB 
BOND PAD 
IDENTITY 



TAB C31 
TEST PAD 
LOCATIONS 



X COORDINATE OF 
THE CENTER OF 
BOND PAD 



Y COORDINATE OF 
THE CENTER OF 
BOND PAD 



PITCH OF LEAD(#,#) 
REFERENCE WHICH DIE 
BOND PADS 



34 
35 
36 
37 
38 
39 
40 
41 
42 
43 
44 
45 
46 
47 
48 
49 
50 
51 
52 
53 
54 
55 
56 
57 
58 
59 
60 
61 
62 
63 
64 
65 
66 



A9 

DysS 
A8 
A7 
A6 
A5 

AV DD 
A4 
A3 
A2 
A1 
AO 

cvss 

D31 

Vddl 

* UUL 

D30 
V SSL 
VSSL 
DV S S 

D29 

D28 
DV DD 

D27 

•vss 

D26 
D25 
D24 
D23 
D22 
D21 
DV DD 
D20 



0.00 
300.00 
569.20 
843.80 
1137.00 
1415.60 
1710.80 
1974.00 
2251.40 
2536.40 
2809.80 
3108.20 
3406.00 
3662.80 
3983.60 
41 64.00 
4457.80 
4821.40 
5001.40 
5316.80 
5594.80 
5873.20 
6193.40 
6543.20 
6796.40 
7102.20 
7374.40 
7659.60 
7947.40 
8237.80 
8496.60 
8788.20 
9012.40 



0.00 



300.00 (34, 35) 
269.20 (35, 36) 
274.60 (36, 37) 
293.20 (37, 38) 
278.60 (38, 39) 
295.20 (39, 40) 
263.20(40, 41) 
277.40(41,42) 
285.00 (42, 43) 
273.40 (43, 44) 
298.40 (44, 45) 
297.80 (45, 46) 
256.80 (46, 47) 
320.80 (47, 48) 
180.40(48,49) 
293.80 (49, 50) 
180.00 (51,52) 
315.40(52, 53) 
278.00 (53, 54) 
278.40 (54, 55) 
320.20 (55, 56) 
349.80 (56, 57) 
253.20 (57, 58) 
305.80 (58, 59) 
272.20 (59, 60) 
285.20 (60, 61) 
287.80(61,62) 
290.40 (62, 63) 
258.80 (63, 64) 
291.60(64, 65) 
224.20 (65, 66) 
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Table 2. SMJ320C31 Die Pad /TAB Lead Information : rev 2.0 (0.8 \im) (Continued) 



DIE SIDE #3 



C31 DIE 
BOND PAD 
LOCATIONS 



DIE/TAB 
BOND PAD 
IDENTITY 



TAB C31 
TEST PAD 
LOCATIONS 



X COORDINATE OF 
THE CENTER OF 
BOND PAD 



Y COORDINATE OF 
THE CENTER OF 
BOND PAD 



PITCH OF LEAD(#,#) 
REFERENCE WHICH DIE 
BOND PADS 



67 
68 
69 
70 
71 
72 
73 
74 
75 
76 
77 
78 
79 
80 
81 
82 
83 
84 
85 
86 
87 
88 
89 
90 
91 
92 
93 
94 
95 
96 
97 
98 
99 



DV S S 
D19 
D18 
D17 
D16 
D15 

cv S s 

D14 
DV DD 
D13 

ivss 

D12 
D11 
D10 
VDDL 

Vddl 

D9 

D8 
DV S S 
VSSL 
VSSL 

D7 

D6 
DV DD 

D5 

D4 

D3 

D2 

D1 

DO 

H1 

H3 
DVDD 



9780.40 



508.60 

861.20 

1142.00 

1414.00 

1682.80 

1926.00 

2301.60 

2514.00 

2828.00 

3035.60 

3436.20 

3650.80 

3919.60 

4213.20 

4556.60 

4736.20 

5051.60 

5333.20 

5618.40 

5958.40 

6138.80 

6428.40 

6714.80 

7012.60 

7279.60 

7560.40 

7842.80 

8127.60 

8403.60 

8689.20 

8979.60 

9254.00 

9631.20 



352.60 (67, 68) 
280.80(70, 71) 
272.00 (69, 70) 
268.80(70, 71) 
243.20(71,72) 
375.60 (72, 73) 
212.40 (73,74) 
314.00(74, 75) 
207.60 (75, 76) 
400.60 (76, 77) 
214.60(77, 78) 
268.80 (78, 79) 
293.60 (79, 80) 
343.40(80, 81) 
179.60(81,82) 
315.40(82, 83) 
281.60(83,84) 
285.20 (84, 85) 
340.00 (85, 86) 
180.40(86, 87) 
289.60 (87, 88) 
286.40 (88, 89) 
297.80 (89, 90) 
267.00(90, 91) 
280.80(91,92) 
282.40 (92, 93) 
284.80 (93, 94) 
276.00 (94, 95) 
285.60 (95, 96) 
290.40 (96, 97) 
274.40 (97, 98) 
377.20 (98, 99) 
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Table 2. SMJ320C31 Die Pad /TAB Lead Information : rev 2.0 (0.8 \im) (Continued) 



DIE SIDE #4 


V/Ol UIC 


nic /tar 

UIC./ IAD 


TAB f*«M 
IAD IrfJl 


v r^nnoniM ate oc 
A UUUKUIIMAI C Ur 


Y VsUUnUIIMAI C UP 


DITS*U f\C 1 B An t# Jt\ 

rl 1 Un Ur LcAU (JF, n ) 


nnkin DAn 

DUIMU rAU 


no kin DAn 
DUIMU rAU 


TCCT DAn 

1 to I rAU 


TUB f*EKITED f\XS 

Trie UclM 1 erf Ur 


TUB f*CMTED OB 


DCECDCM^C 1A/UIS*U niB 

ncrbKcNUb Wniun Ulc 


LvwH 1 IvsPiO 


IDENTITY 


1 OCATIOMQ 
LwV/AI IwlMO 


□OKin PAD 

DUNU rAU 


DAun PAD 
ounu rAU 


DOMn PA DC 


100 


DVss 




9032.60 




210.40 (100,101) 


101 


CVss 




8822.20 




280.00 (101, 102) 


102 


ivss 




8543.20 




301.80(102, 103) 


103 


\j>r% 1 S^l LSI ft. 1 

X2/CLKIN 




8240.40 




186.20 (103, 104) 


104 


X1 




8054.20 




311.40 (104, 105) 


105 


HOLDA 




7742.80 




282.80 (105, 106) 


106 


HOLD 




7460.00 




293.00 (106, 107) 


107 


CVdq 




7167.00 




431.00 (107, 108) 


108 


RDY 




6736.00 




276.80 (108, 109) 


109 


STRB 




6459.20 




268.00(109, 110) 


110 


R/W 




6191.20 




295.20 (110, 111) 


111 


RESET 




5896.00 




278.40 (111, 112) 


112 


XF0 




5617.60 




266.60 (112, 113) 
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Appendix F 



Analog Interface Peripherals and 

Applications 



Texas Instruments (Tl) offers many products for total system solutions, includ- 
ing memory options, data acquisition, and analog input/output devices. This 
appendix describes a variety of devices that interface directly to the TMS320 



DSPs in rapidly expanding applications. 

Major topics discussed in this appendix are listed below. 

Topic Page 

F.1 Multimedia Applications F-2 

F.2 Telecommunications Applications F-5 

E3 Dedicated Speech Synthesis Applications F-11 

F.4 Servo Control/Disk Drive Applications F-1 4 

F.5 Modem Applications F-1 7 

F.6 Advanced Digital Electronics Applications for Consumers ...... F-20 
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FA Multimedia Applications 

Multimedia integrates different media through a centralized computer. These 
media can be visual or audio and can be input to or output from the central 
computer via a number of technologies. The technologies can be digital-based 
or analog-based (such as audio or video tape recorders). The integration and 
interaction of media enhance the transfer of information and can accommo- 
date both analysis of problems and synthesis of solutions. 

Figure F-1 shows both the central role of the multimedia computer and the 
multimedia system's ability to integrate the various media to optimize informa- 
tion flow and processing. 



Figure F-1. System Block Diagram 
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F.1.1 System Design Considerations 

Multimedia systems can include various grades of audio and video quality. The 
most popular video standard currently used (VGA) covers 640 x 480 pixels 
with 1 , 2, 4, and 8-bit memory-mapped color. Also, 24-bit true color is sup- 
ported, and 1 024 x 768 (beyond VGA) resolution has emerged. There are two 
grades of audio. The lower grade accommodates 1 1 .25-kHz sampling for 8-bit 
monaural systems, while the higher grade accommodates 44.1 -kHz sampling 
for 16-bit stereo. 

Audio specifications include a musical instrument digital interface (MIDI) with 
compression capability, which is based on keystroke encoding, and an input/ 
output port with a three-disc voice synthesizer. In the media control area, video 
disc, CD audio, and CD ROM player interfaces are included. Figure F-2 
shows a multimedia subsystem. 
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The TLC32047 wide-band analog interface circuit (AIC) is well suited for multi- 
media applications because it features wide-band audio and up to 25-kHz 
sampling rates. The TLC32047 is a complete analog-to-digital and digital-to- 
analog interface system for the TMS320 DSPs. The nominal bandwidths of the 
filters accommodate 11.4 kHz, and this bandwidth is programmable. The 
application circuit shown in Figure F-2 handles both speech encoding and 
modem communication functions, which are associated with multimedia appli- 
cations. 



Figure F-2. Multimedia Speech Encoding and Modem Communication 
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Figure F-3 shows the interfacing of the TMS320C25 DSP to the TLC32047 
AIC, which constitutes a building block of the 9600-bps V.32 bis modem shown 
in Figure F-2. 



Figure F-3. TMS320C25 to TLC32047 Interface 
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F.1.2 Multimedia-Related Devices 

As shown in Table F-1 and Table F-2, Tl provides a complete array of analog 
and graphics interface devices. These devices support the TMS320 DSPs for 
complete multimedia solutions. 



Table F-1. Data Converter ICs 





uosci ipuun 


I/O 


Resolution 


Conversion 

r*l If Data 

ULl\ Kale 


A vM>li nation 

Mppiiuaiiun 


TLC320AC01 


Analog interface (5 V only) 


Serial 


14 


43.2 kHz 


Portable modem and 
speech, multimedia 


TLC32047 


Analog interface 

HI vl WlAy Q\AA /A|p\ 


Serial 


14 


25 kHz 


Speech, modem, and 

mi iltimoHia 
8 1 lUllll IIUUlCl 


TLC32046 


Ana Inn interface SAIf^ 


Serial 


14 


25 kHz 


Sneerh anH modems 


TLC32044 


Anaion interface fAIC^ 


Serial 


14 


19.2 kHz 


Sneerh and modems 


TLC32040 


Analoa interface fAIC^ 


Serial 


14 


19.2 kHz 


Sneeeh and modems 

UpCCwl 1 Cll IVd 1 1 IwU wl 1 IO 


TLC34075/6 


Video nalette 


Parallel 


Trinle 8 


135 MHz 

1 SJyJ 1 VI 1 1 i- 


fnranhio^ 


TLC34058 


Video palette 


Parallel 


Triple 8 


135 MHz 


Graphics 


TLC5502/3 


Flash ADC 


Parallel 


8 


20 MHz 


Video 


TLC5602 


Video DAC 


Parallel 


8 


20 MHz 


Video 


TLC5501 


Flash ADC 


Parallel 


6 


20 MHz 


Video 


TLC5601 


Video DAC 


Parallel 


6 


20 MHz 


Video 


TLC1 550/1 


ADC 


Parallel 


10 


150 kHz 


Servo Ctrl / speech 


TLC32071 


Analog interface (AIC) 


Parallel 


8 


1 MHz 


Servo Ctrl / disk drive 


TMS57013/4 


Dual audio DAC + digital 
filter 


Serial 


16/18 


32, 37.8, 
44.1,48 kHz 


Digital audio 



Table F-2. Switched-Capacitor Filter ICs 



Device Function Order Roll-Off Power Out Power Down 



TLC2470 


Differential audio filter amplifier 


4 


5 kHz 


500 mW 


Yes 


TLC2471 


Differential audio filter amplifier 


4 


3.5 kHz 


500 mW 


Yes 


TLC10/20 


General-purpose dual filter 


2 


CLK + 50 
CLK +100 


N/A 


No 


TLC04/14 


Low pass, Butterworth filter 


4 


CLK + 50 
CLK +100 


N/A 


No 



For application assistance or additional information, please call Tl Linear 
Applications at (214) 997-3772. 
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F.2 Telecommunications Applications 

The Tl linear product line focuses on three primary telecommunications appli- 
cation areas: 

□ Subscriber instruments (telephones, modems, etc.) 

Includes the TCM508x DTMF tone encoder family, the TCM150x tone 
ringer family, the TCM1 520 ring detector, and the TCM31 05 FSK modem. 

□ Central office line card products 

Includes the TCM29Cxx combo (combined PCM filter plus codec) family, 
the TCM420x subscriber line control circuit family, and the TCM1 030/60 
line card transient protector. 

□ Personal communications products 

Includes the TCM320AC3x family of 5-volt voice-band audio processors 
(VBAP). 

Tl continues to develop new telecom integrated circuits, such as a high-perfor- 
mance three-volt combo family for personal communications applications and 
an RF power amplifier family for hand-held and mobile cellular phones. 

System Design Considerations. The size, network complexity, and com- 
patibility requirements of telecommunications central office systems create 
demanding performance requirements. Combo voice-band filter performance 
is typically ± 0.1 5 dB in the passband. Idle channel noise must be on the order 
of 15 dBrncO. Gain tracking (S/Q) and distortion must also meet stringent re- 
quirements. The key parameters for a SLIC device are gain, longitudinal bal- 
ance, and return loss. 
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Figure F-4. Typical DSP/Combo Interface 
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The TCM320AC36 combo interfaces directly to the TMS320C25 serial port 
with a minimum of external components, as shown in Figure F-4. Half of hex 
inverter U3 and crystal Y1 form an oscillator that provides clock timing to the 
TCM320AC36. The synchronous four-bit counters U1 and U2 generate an 
8-kHz frame sync signal. DCLKR on the TCM320AC36 is connected to Vqd, 
placing the combo in fixed data-rate mode. Two 20-kQ resistors connected to 
ANLGIN and MIC_GS set the gain of the analog input amplifierto 1 . The timing 
is shown in Figure F-5. 
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Figure F-5. DSP/Combo Interface Timing 
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Telecommunications-Related Devices. Data sheets for the devices in 
Table F-3 on page F-8 are contained in the 1991 Telecommunications Cir- 
cuits Databook (literature number SCTD001 B). To request your copy, contact 
your nearest Tl field sales office or call the Literature Response Center at (800) 
477-8924. 



Analog Interface Peripherals and Applications F-7 



Telecommunications Applications 



Table F-3. Telecom Devices 



Device Number 


Coding 
Law 


Clock Rates 
MHzt 


# of Bits 


Comments 


Codec/Filter 


TCM29C13 


A and n 


1 .544, 1 ,536, 2.048 


8 


CO. and PBX line cards 


TCM29C14 


A and n 


1 .544, 1 .536, 2.048 


8 


Includes 8th-bit signal 


TCM29C16 


u 


2.048 


8 


1 6-pin package 


TCM29C17 


A 


2.048 


8 


1 6-pin package 


TCM29C18 


u 


2.048 


8 


Low-cost DSP interface 


TCM29C19 


U 


1.536 


8 


Low-cost DSP interface 


TCM29C23 


A and n 


Up to 4.096 


8 


Extended frequency range 


TCM29C26 


A and i* 


Up to 4.096 


8 


Low-power TCM29C23 


TCM320AC36 


\i and Linear 


Up to 4.096 


8 and 13 


Single voltage (+5) VBAP 


TCM320AC37 


A and Linear 


Up to 4.096 


8 and 13 


Single voltage (+5) VBAP 


TCM320AC38 


n and Linear 


Up to 4.096 


8 and 13 


Single voltage (+5) GSM 


TCM320AC39 


A and Linear 


Up to 4.096 


8 and 13 


Single voltage (+5) GSM 


TP3054/64 




1.544, 1.536, 2.048 


8 


National Semiconductor 
second source 


TP3054/67 


A 


1.544, 1.536, 2.048 


8 


National Semiconductor 
second source 


TLC320AC01 


Linear 


43.2 kHz 


14 


5-volt-only analog interface 


TLC32040/1 


Linear 


Up to 19.2-kHz sampling 


14 


For high-dynamic linearity 


TLC32044/5 


Linear 


Up to 1 9.2-kHz sampling 


14 


For high-dynamic linearity 


TLC32046 


Linear 


Up to 25-kHz sampling 


14 


For high-dynamic linearity 


TLC32047 


Linear 


Up to 25-kHz sampling 


14 


For high-dynamic linearity 


Transient Suppressor 


TCM1030 


Transient suppressor for SLIC-based line card 




(30 A max) 


TCM1060 


Transient suppressor for SLIC-based line card 




(60 A max) 



T Unless otherwise noted 
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Table F-4 is a list of switched-capacitor filter ICs. 
Table F-4. Switched-Capacitor Filter ICs 



Device 


Function 


Order 


Roll-Off 


Power Out 


Power Down 


TLC2470 


Differential audio filter amplifier 


4 


5 kHz 


500 mW 


Yes 


TLC2471 


Differential audio filter amplifier 


4 


3.5 kHz 


500 mW 


Yes 


TLC10/20 


General-purpose dual filter 


2 


CLK + 50 
CLK+100 


N/A 


No 


TLC04/14 


Low pass, Butterworth filter 


4 


CLK + 50 
CLK+100 


N/A 


No 



For further information on these telecommunications products, please call 
(214)997-3772. 

Figure F-6 and Figure F-7 show telecom applications. 
Figure F-6. General Telecom Applications 
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Figure F-7. Generic Telecom Applications 
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F.3 Dedicated Speech Synthesis Applications 

For dedicated speech synthesis applications, Tl offers a family of dedicated 
speech synthesizer chips. This speech technology has been used in a wide 
range of products, including games, toys, burglar alarms, fire alarms, automo- 
biles, airplanes, answering machines, voice mail, industrial control machines, 
office machines, advertisements, novelty items, exercise machines, and 
learning aids. 

Dedicated speech synthesis chips are a good alternative for low-cost applica- 
tions. The speech synthesis technology provided by the dedicated chips is ei- 
ther linear-predictive coding (LPC) or continuously variable slope delta modu- 
lation (CVSD). Table F-5 shows the characteristics of the Tl voice synthesiz- 
ers. 



Table F-5. Tl Voice Synthesizers 


Device 


Microprocessor 


Synthesis 
Method 


I/O Pins 


On-Chip 
Memory 
(Bits) 


External 
Memory 


Data Rate 
(Bits/Sec) 


TSP50C4X 


8-bit 


LPC-10 


20/32 


64K/128K 


VROM 


1200-2400 


TSP50C1X 


8-bit 


LPC-12 


10 


64K/128K 


VROM 


1200-2400 


TSP53C30 


8-bit 


LPC-10 


20 


N/A 


From host \xP 


1200-2400 


TSP50C20 


8-bit 


LPC-10 


32 


N/A 


EPROM 


1200-2400 


TMS3477 


N/A 


CVSD 


2 


None 


DRAM 


16K-32K 



In addition to the speech synthesizers, Tl has low-cost memories that are ideal 
for use with these chips. Tl can also be of assistance in developing and pro- 
cessing the speech data that is used in these speech synthesis systems. 
Table F-6 shows speech memory devices of different capabilities. Additional- 
ly, audio filters are outlined in Table F-7. 
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Table F-6. Speech Memories 

TSP60Cxx Family of Speech ROMs 



Family Size No. of Pins Interface For use with: 



TSP60C18 


256K 16 




Parallel 4-bit 


TSP50C1X 


TSP60C19 


256K 16 




Serial 




TSP50C4X 


TSP60C20 


256K 28 




Parallel/serial 
8-bit 


TSP50C4X 


TSP60C80 


1M 28 




Serial 




TSP50C4X 


TSP60C81 


1M 28 




Parallel 4-bit 


TSP50C1X 


Table F-7. Switched-Capacitor Filter ICs 


r\-. -i — - 
lsc vi we 


Function 


Order 


Roil-Off 


Power Out Power Down 


TLC2470 


Differential audio filter amplifier 


4 


5 kHz 


500 mW 


Yes 


TLC2471 


Differential audio filter amplifier 


4 


3.5 kHz 


500 mW 


Yes 


TLC10/20 


General-purpose dual filter 


2 


CLK + 50 
CLK +100 


N/A 


No 


TLC04/14 


Low pass, Butterworth filter 


4 


CLK + 50 
CLK+100 


N/A 


No 
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Table F-8 lists some of Tl's speech synthesis development tools. 
Table F-8. Speech Synthesis Development Tools 



Name 


Definition 


(a) Software 


EVM 


Code development tool 


(b) Speech 


SAB 


Speech audition board 


SD85000 


PC-based speech analysis system 


(c) System 


SEB 


System emulator board 


SEBGOCxx 


System emulator boards for speech memories 



For further information, call Linear Applications at (214) 997-3772. 
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F.4 Servo Control/Disk Drive Applications 

In the past, most servo control systems used only analog circuitry. However, 
the growth of digital signal processing (DSP) has made digital control theory 
a reality. Figure F-8 is a block diagram of a generic digital control system using 
a DSP, along with an analog-to-digital converter (ADC) and a digital-to-analog 
converter (DAC). 

Figure F-8. Generic Servo Control Loop 
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In a DSP-based control system, the control algorithm is implemented via soft- 
ware. No component aging or temperature drift is associated with digital con- 
trol systems. Additionally, sophisticated algorithms can be implemented and 
easily modified to upgrade system performance. 

System Design Considerations 

TMS320 DSPs have facilitated the development of high-speed digital servo 
control for disk drive and industrial control applications. In recent years, disk 
drives have increased storage capacity from 5 megabytes to over 1 gigabyte. 
This equates to a 23,900 percent growth in capacity. To accommodate these 
increasingly higher densities, the data on the servo platters, whether servo-po- 
sitioning or actual storage information, must be converted to digital electronic 
signals at increasingly closer points in relation to the platter pick-off point. The 
ADC must have increasingly higher conversion rates and greater resolution 
to accommodate the increasing bandwidth requirements of higher storage 
densities. In addition, the ADC conversion rates must increase to accommo- 
date the shorter data retrieval access time. 
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Figure F-9 is a block diagram of a disk drive control system. 
Figure F-9. Disk Drive Control System Block Diagram 
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Table F-9 lists analog/digital interface devices used for servo control. 
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Table F-9. Control-Related Devices 



Function 



Device 



Bits 



Speed 



Channels Interface 



ADC 



nAr. 



AIC 



TLC1550 

TLC1551 

TLC5502/3 

TLC0820 

TLC1225 

TLC1558 

TLC1543 

TLC1549 

TLC7524 

TLC7628 

TLC5602 

TLC32071 



10 

10 

8 

8 

13 

10 

10 

10 

8 

8 

8 

8 (ADC) 



3-5 \xs 

3-5 [is 

50 ns (flash) 

1.5 jxs 

12 \is 

3-5 jxs 

21 us 

21 [is 

9 MHz 

9 MHz 

30 MHz 

1 [is 
9 MHz 



(DHL) 



(Dual) 



Parallel 

Parallel 

Parallel 

Parallel 

Parallel 

Parallel 

Serial 

Serial 

Parallel 

Parallel 

Parallel 

Parallel 



Figure F-10 shows the interfacing of the TMS320C14 and the TLC32071 . 
Figure F-10. TMS320C14-TLC32071 Interface 
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For further information on these servo control products, please call Tl Linear 
Applications at (214) 997-3772. 
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F.5 Modem Applications 

High-speed modems (9,600 bps and above) require a great deal of analog sig- 
nal processing in addition to digital signal processing. Designing both high- 
speed capabilities and slower fall-back modes poses significant engineering 
challenges. Tl offers a number of analog front-end (AFE) circuits to support 
various high-speed modem standards. 

The TLC32040, TLC32044, TLC32046, TLC32047, and TLC320AC01 AlCs 
are especially suited for modem applications by the integration of an input mul- 
tiplexer, switched capacitor filters, high resolution 14-bit ADC and DAC, a four- 
mode serial port, and control and timing logic. These converters feature ad- 
justable parameters, such as filtering characteristics, sampling rates, gain se- 
lection, (sin x)/x correction (TLC32044, TLC32046, and TLC32047 only), and 
phase adjustment. All of these parameters are software-programmable, mak- 
ing the AIC suitable for a variety of applications. Table F-1 0 has the descrip- 
tion and characteristics of these devices. 



Table F-1 0. Modem AFE Data Converters 



Device 


Description 


i/O 


Resolution 
(Bits) 


Conversion 
Rate 


TLC32040 


Analog interface chip (AIC) 


Serial 


14 


19.2 kHz 


TLC32041 


AIC without on-board Vref 


Serial 


14 


19.2 kHz 


TLC32044 


Telephone speed/modem AIC 


Serial 


14 


19.2 kHz 


TLC32045 


Low-cost version of the TLC32044 


Serial 


14 


19.2 kHz 


TLC32046 


Wide-band AIC 


Serial 


14 


25 kHz 


TLC32047 


AIC with11.4-kHz BW 


Serial 


14 


25 kHz 


TLC320AC01 


5-volt-only AIC 


Serial 


14 


43.2 kHz 


TCM29C18 


Companding codec/filter 


PCM 


8 


8 kHz 


TCM29C23 


Companding codec/filter 


PCM 


8 


16 kHz 


TCM29C26 


Low-power codec/filter 


PCM 


8 


16 kHz 


TCM320AC36 


Single-supply codec/filter 


PCM and 
Linear 


8 


25 kHz 
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The AIC interfaces directly with serial-input TMS320 DSPs, which execute the 
modem's high-speed encoding and decoding algorithms. The TLC320C4x 
family performs level-shifting, filtering, and A/D and D/A data conversion. The 
DSP's software-programmable features provide the flexibility required for mo- 
dem operations and make it possible to modify and upgrade systems easily. 
Under DSP control, the AlC's sampling rates permit designers to include fall- 
back modes without additional analog hardware in most cases. Phase adjust- 
ments can be made in real time so that the A/D and D/A conversions can be 
synchronized with the upcoming signal. In addition, the chip has a built-in loop- 
back feature to support modem self-test requirements. 

For further information or application assistance, please call Tl Linear Applica- 
tions at (214) 997-3772. 

Figure F— 11 shows a V.32 bis modem implementation using the TMS320C25 
and a TLC320AC01 . The upper TMS320C25 performs echo cancellation and 
transmit data functions, while the lower TMS320C25 performs receive data 
and timing recovery functions. The echo canceler simulates the telephone 
channel and generates an estimated echo of the transmit data signal. 

Figure F-11. High-Speed V.32 Bis and Multistandard Modem With the TLC320AC01 AIC 
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The TLC320AC01 performs the following functions: 
□ Upper TLC320AC01 D/A Path 

Converts the estimated echo, as computed by the upper TMS320C25, into 
an analog signal, which is subtracted from the receive signal 

Upper TLC320AC01 A/D Path 

Converts the residual echo to a digital signal for purposes of monitoring 
the residual echo and continuously training the echo canceler for optimum 
performance. The converted signal is sent to the upper TMS320C25. 



□ 
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□ Lower TLC320AC01 D/A Path 

Converts the upper TMS320C25 transmit output to an analog signal, per- 
forms a smoothing filter function, and drives the DAC 

□ Lower TLC320AC01 A/D Path 

Converts the echo-free receive signal to a digital signal, which is sent to 
the lower TMS320C25 to be decoded 



Note: Modem Functions 

Figure F-1 1 is for illustration only. In reality, one single TMS320C5x DSP can 
implement high-speed modem functions. 

■ — i 
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F.6 Advanced Digital Electronics Applications for Consumers 

With the extensive use of the TMS320 DSPs in consumer electronics, much 
electromechanical control and signal processing can be done in the digital do- 
main. Digital systems generally require some form of analog interface, usually 
in the form of high-performance ADCs and DACs. Figure F-1 2 shows the gen- 
eral performance requirements for a variety of applications. 



Figure F-1 2. Applications Performance Requirements 
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Advanced Television System Design Considerations. Advanced 
Digital Television (ADTV) is a technology that uses DSP to enhance video and 
audio presentations and to reduce noise and ghosting. Because of these DSP 
techniques, a variety of features can be implemented, including frame store, 
picture-in-picture, improved sound quality, and zoom. The bandwidth require- 
ments remain at the existing six-M Hz television allocation. From the intermedi- 
ate frequency (IF) output, the video signal is converted by an eight-bit video 
ADC. The digital output can be processed in the digital domain to provide noise 
reduction, interpolation or averaging for digitally increased sharpness, and 
higher quality audio. The DSP digital output is converted back to analog by a 
video DAC, as shown in Figure F-1 3. 
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Figure F-13. Video Signal Processing Basic System 
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Video casette recorders (VCRs), compact disc (CD) and digital audio tape 
(DAT) players, and personal computers (PCs) are a few of the products that 
have taken a major position in the marketplace in recent years. The audio 
channels for compact disc and DAT require 1 6-bit A/D resolution to meet the 
distortion and noise standards. See Figure F-1 4 for a block diagram of a typi- 
cal digital audio system. 



Figure F-1 4. Typical Digital Audio Implementation 
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The motion and motor control systems usually use 8- to 10-bit ADCs for the 
lower frequency servo loop. Tape or disk systems use motor or motion control 
for proper positioning of the record or playback heads. With the storage me- 
dium compressing data into an increasingly smaller physical size, the position- 
ing systems require more precision. 

The audio processing becomes more demanding as higher fidelity is required. 
Better fidelity translates into lower noise and distortion in the output signal. 

The TMS5701 3DW/5701 4DW one-bit DACs include an eight-times-over sam- 
pling digital filter designed for digital audio systems, such as compact disk 
players (CDPs), DATs, compact disks interactive (CDIs), laser disk players 
(LDPs), digital amplifiers, and car stereos. They are also suitable for all sys- 
tems that include digital sound processing like TVs, VCRs, musical instru- 
ments, multimedia, etc. 

The converters have dual channels so that the right and left stereo signals can 
be transformed into analog signals with only one chip. There are some func- 
tions that allow the customers to select the conditions according to their appli- 
cations, such as muting, attenuation, de-emphasis, and zero data detection. 
These functions are controlled by external 1 6-bit serial data from a controller 
like a microcomputer. 

The TMS5703DW/5701 4DW adopt 1 29-tap finite impulse response (Fl R) filter 
and third-order A 2 modulation to get -75-dB stop band attenuation and 96-dB 
signal noise ratio (SNR). The output is pulse width modulation (PWM) wave, 
which facilitates analog signals through a low-pass filter. 

Table F— 11 lists Tl products for analog interfacing to digital systems. 



F-22 



Advanced Digital Electronics Applications for Consumers 



Table F-11. Audio/Video Analog/Digital Interface Devices 



Function 


Device 


Bits 


Speed 


Channels 


Interface 


Dual audio DAC + digital filter 


TMS57013/4 


16/18 


32, 37.8, 
44.1, 48 kHz 


2 


Serial 


Analog interface 

A/D 

D/A 


TLC32071 


8 
8 


2 t*s 
15 \is 


8 

■j 


Parallel 
Parallel 

1 wll will Wl 


A/D 


TLC1225 


12 


12 




Parallel 


A/D 


TLC1 550 


10 


6 ja,s 




Parallel 

1 wll wlllvsl 


Virion H/A 




ft 


\J\J l lO 


1 


Parallpl 

1 Cll CUIwl 


Video D/A 


TL5602 


8 


50 ns 


1 


Parallel 


Triple video D/A 


TL5632 


8 


16 ns 


3 


Parallel 


Triple flash A/D 


TLC5703 


8 


70 ns 


3 


Parallel 


Flash A/D 


TLC5503 


8 


100 ns 


1 


Parallel 


Flash A/D 


TLC5502 


8 


50 ns 


1 


Parallel 



For further information or application assistance, please call TI Linear Applica- 
tions at (214) 997-3772. 
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This appendix contains the source code for the TMS320C3x boot loader. 
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************************************************************************ 

* C31B00T - TMS320C31 BOOT LOADER PROGRAM 

* (C) COPYRIGHT TEXAS INSTRUMENTS INC., 1990 
* 

* NOTES 1. AFTER DEVICE RESET, THE PROGRAM IS SET TO WAIT FOR 

* THE EXTERNAL INTERRUPTS. THE FUNCTION SELECTION OF 

* THE EXTERNAL INTERRUPTS IS AS FOLLOWS; 

* , 

* INTERRUPT PIN | FUNCTION 

* I 

* 0 | EPROM boot loader from 100 OH 

* I 

* 1 | EPROM boot loader from 400000H 

* I 

* 2 j EPROM boot loader from FFF000H 

* | 

* 3 | Serial port 0 boot loader 

* 2. THE EPROM BOOT LOADER LOADS WORD, HALFWORD, OR BYTE- 

* WIDE PROGRAMS TO SPECIFIED LOCATIONS. THE 

* 8 LSBS OF FIRST MEMORY SPECIFY THE MEMORY WIDTH OF 

* THE EPROM. IF THE HALFWORD OR BYTE-WIDE PROGRAM IS 

* SELECTED, THE LSBs ARE LOADED FIRST, FOLLOWED BY THE MSBs . 

* THE FOLLOWING WORD CONTAINS THE CONTROL WORD FOR 

* THE LOCAL MEMORY REGISTER. THE PROGRAM BLOCKS FOLLOW. 

* THE FIRST TWO WORDS OF EACH PROGRAM BLOCK CONTAIN 

* THE BLOCK SIZE AND MEMORY ADDRESS TO BE LOADED INTO. 

* WHEN THE ZERO BLOCK SIZE IS READ, THE PROGRAM BLOCK 

* LOADING IS TERMINATED. THE PC WILL BRANCH TO THE 

* STARTING ADDRESS OF THE FIRST PROGRAM BLOCK. 
* 

* 3. IF SERIAL PORT 0 IS SELECTED FOR BOOT LOADING, THE 

* PROCESSOR WILL WAIT FOR THE INTERRUPT FROM THE 

* RECEIVE SERIAL PORT 0 AND PERFORM THE DOWNLOAD. 

* AS WITH THE EPROM LOADER, PROGRAMS CAN BE LOADED 

* INTO DIFFERENT MEMORY BLOCKS. THE FIRST TWO WORDS OF EACH 

* PROGRAM BLOCK CONTAIN THE BLOCK SIZE AND MEMORY ADDRESS 

* TO BE LOADED INTO. WHEN THE ZERO BLOCK SIZE IS READ, 

* PROGRAM BLOCK LOADING IS TERMINATED. IN OTHER WORDS, 

* IN ORDER TO TERMINATE THE PROGRAM BLOCK LOADING, 

* A ZERO HAS TO BE ADDED AT THE END OF THE PROGRAM BLOCK. 

* AFTER THE BOOT LOADING IS COMPLETED, THE PC WILL BRANCH 

* TO THE STARTING ADDRESS OF THE FIRST PROGRAM BLOCK. 
* 

************************************* 
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.global check 




• sect 


"vectors" 


reset 


.word 


check 


into 


• word 


809FClh 


intl 


• word 


809FC2h 


int2 


• word 


809FC3h 


int3 


• word 


809FC4h 


xintO 


• word 


809FC5h 


rintO 


.word 


809FC6h 




• word 


809FC7h 




.word 


809FC8h 


tintO 


• word 


809FC9h 


tintl 


• word 


809FCAh 


dint 


.word 


809FCBh 




• word 


809FCCh 




.word 


809FCDh 




• word 


809FCEh 




• word 


809FCFh 




• word 


809FD0h 




• word 


809FDlh 




• word 


809FD2h 




• word 


809FD3h 




• word 


809FD4h 




• word 


809FD5h 




• word 


809FD6h 




• word 


809FD7h 




• word 


809FD8h 




• word 


809FD9h 




• word 


809FDAh 




• word 


809FDBh 




• word 


809FDCh 




• word 


809FDDh 




• word 


809FDEh 




• word 


809FDFh 



******************************* 

trapO .word 809FE0h 

trapl .word 809FElh 

trap2 .word 809FE2h 

trap3 .word 809FE3h 

trap4 .word 809FE4h 

trap5 .word 809FE5h 

trap6 .word 809FE6h 

trap7 .word 809FE7h 

trap8 .word 809FE8h 

trap9 .word 809FE9h 

trap 10 .word 809FEAh 
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trap 11 
trap 12 
trapl3 
trap 14 
trapl5 
trap 16 
trap 17 
trap 18 
trapl9 
trap20 
trap21 
trap22 
trap23 
trap24 
trap25 
trap26 
trap27 



•word 
•word 
•word 
•word 
•word 
•word 
•word 
•word 
•word 
•word 
•word 
•word 
•word 
•word 
•word 
•word 
•word 
•word 

• WW J. \A 

•word 
•word 



809FEBh 
809FECh 
809FEDh 
809FEEh 
809FEFh 
809FF0h 
809FFlh 
809FF2h 
809FF3h 
809FF4h 
809FF5h 
809FF6h 
809FF7h 
809FF8h 
809FF9h 
809FFAh 
809FFBh 
809FFCh 

1 Ull 

809FFEh 
809FFFh 



•space 5 

check: 



intloop 



eprom_load 



LDI 


4040h,AR0 j 


load peripheral mem. map 


LSH 


9,AR0 ; 


start addr. 808000h 


LDI 


404Ch,SP j 


initialize stack pointer to 


LSH 


9/SP j 


ramO addr. 809800h 


LDI 


0,R0 j 


set start address flag off 


TSTB 


8, IF t 


test for ext int3 


BNZ 


serial , 


on int3 go to serial 


LDI 


8, AR1 < 


load OOlOOOh / 2 A 9 -> ARl 


TSTB 


1 , IF , 


test for into 


BNZ 


eproinJLoad , 


branch to eprom_load if into = 1 


LDI 


2000h,ARl 


load 400000h / 2 A 9 -> ARl 


TSTB 


2, IF 


test for intl 


BNZ 


eprom__load t 


branch to eprom__load if intl = 1 


LDI 


7FF8h,ARl 


load FFFOOOh / 2 A 9 -> ARl 


TSTB 


4, IF 


test for int2 


BZ 


intloop 


» if no intX go to intloop 


LSH 


9 , ARl 


• eprom address = ARl * 2 A 9 


LDI 


*AR1++(1) ,R1 


' load eprom mem. width 


LDI 


sub_w f AR3 


; full-word size subroutine 






f address —> AR3 


LSH 


26 f Rl 


; test bit 5 of mem. width word 


BN 


loadO 


; if '1' start PGM loading 






) (32 bits width) 
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loadO 



load2 



serial 



loop_s 
sub s 



NOP 


*AR1++ ( 1 ) 


• "iiiitir> last half word from mem. word 


LDI 


sub_Ji,AR3 


; half word size subroutine 






; address — > AR3 


LSH 


1 ,RI 


; test bit 4 of mem. width word 


BN 


loadO 


; if '1' start PGM loading 






; (16 bits width) 


LDI 


SUD__D, ARj 


; byte size subroutine address — > AR3 


ADD I 


2,ARl 


; jump last 2 bytes from mem, word 


CALLU 


AR3 


; load new word 






; according to mem. width 


STI 


Rl,*+AR0(64h) 


; set primary bus control 


CALLU 


AR3 


• load new word accordincf to 






; mem. width 


LDI 


Rl,RC 


; set block size for repeat loop 


CMP I 


0,RC 


; if 0 block size start PGM 


BZ 


AR2 




SUBI 


1,RC 


; block size —1 


CALLU 


AR3 


; load new word according to 






, mem. width 


LDI 


R1,AR4 


; set destination address 


LDI 


R0,R0 


; test start address loaded flag 


LDIZ 


Rl,AR2 


; load start address if flag off 


LDI 


-1,R0 


; set start & dest. address flag on 


SUBI 


1,AR3 


; sub address with loop 




CALLUAR3 


; load new word according to 






; mem. width 


T T\T 
LtVl 




; set dest. address flag off 


ADDI 


1,AR3 


; sub address without loop 


BR 


load2 


; jump to load a new block 






} when loop completed 


. space 


1 




LDI 


sub_s,AR3 


; serial words subroutine 






; address — > AR3 


LDI 


lllh,Rl 


; Rl = OOOOlllh 


STI 


Rl, *+AR0 (43h) 


; set CLKR,DR,FSR as serial port pins 


LDI 


0A30h / R2 




t eu 
Li oil 


1 C DO 

ID / RZ 


; Kz — Aouuuuun 


STI 


R2,*+AR0(40h) 


; set serial port global 






) Ctrl, register 


BR 


load2 


; jump to load 1st block 


• space 


29 




RPTB 


load_s 


; PGM load loop 


TSTB 


20h / IF 




BZ 


sub__s 


; wait for receive buffer full 


AND 


OFDFh, IF 


; reset interrupt flag 
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LDI 


*+AR0(4Ch) ,R1 










LDI 


R0,R0 


? 


test load address flag 






BNN 


end__s 








load_s 


STI 


Rl, *AR4++( 1 ) 


f 


store new word to dest. 


address 


end_s 


RETSU 




f 


return from subroutine 






o space 


2.1 








loop h 


RPTB 


load hi 




PCJM 1 ftaH 1 onn 








* 2VP 1 4-4. ( 1 \ Pi 


# 


1 naH T.QR V»a1 "f Ta7<"^y*<"? 






awn 


U£ mil; I\ J. 










LDI 


* AR 1 ++ f 1 \ P9 




1 03H MQR ha 1 "F XaTO>"H 
XwhU rlOD IICIXX. WUJ.U 






LSH 


16, R2 










OR 


R2,Rl 


} 


Rl ■ a new 3 2 -bit word 






LDI 


R0,R0 




test load address flag 






BNN 


end_h 








load_h 


STI 


R1,*AR4++(1) 


} 


store new word to dest. 


address 


end_h 


RETSU 




} 


return from subroutine 






• space 


ZD 








loop_w 


RPTB 


load_w 


} 


PGM load loop 




sub__w 


LDI 


*AR1++(1) ,R1 


1 


read a new 3 2 -bit word 






LDI 


R0,R0 


} 


test load address flag 






BNN 


end_w 








load_w 


STI 


Rl,*AR4++(l) 


} 


store new word to dest. 


address 


end_w 


RETSU 




} 


return from subroutine 






• space 


1 A 








loop b 


RPTB 


load b 


} 


PGM load Iood 




sub b 


LDI 


*ARl++f 1LR1 

mT*Jk\ jl 1 I l JL J J 4\ jl 










AND 


OFFh.Rl 

w JL JL 4 A a * \ JL 




load 1st bvte ( LSB \ 

XV/UU JL 0 W JLV V Lj»w 1 JLJ LJ» JLJ 1 






LDI 


*AR1++ (1\ .R2 










AND 


0FFh-R2 

W JL JL A JL f A\ m 










LSH 


8,R2 










OR 


R2 ,Rl 


. 

1 


load 2nd bvte 






LDI 


*ARl++f 1 ) ,R2 










AND 


0FFh,R2 










LSH 


16, R2 










OR 


R2 , Rl 


} 


load 3rd bvte 






LDI 


*ARl++f 1 * .R2 

JEjULV JL 1 1 I JL 1 f 1\ 4* 


1 


load 4th bvte ( MSB I 

X wviVi * I* JL JL W J WW I k 4.LJ +J I 








LSH 24, R2 










OR 


R2,R1 


} 


Rl = a new 32-bit word 






LDI 


R0,R0 


} 


test load address flag 






BNN 


end__b 








load_b 


STI 


R1,*AR4++(1) 


1 


store new word to dest. 


address 


end_b 


RETSU 




} 


return from subroutine 





• space 1 

• end 
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ASH instruction 1 0-50 to 1 0-51 
ASH3 and STI instructions (parallel) 

10-54 to 10-55 
ASH3 instruction 1 0-52 to 1 0-53 
Bcond instruction 1 0-56 to 1 0-57 
BcondD instruction 1 0-58 to 1 0-59 
bitwise exclusive-OR 1 0-206 

3-operand instruction 10-207 to 10-208 
bitwise logical-AND 1 0-42 

3-operand instruction 10-43 to 10-44 
bitwise logical-AND with complement 1 0-47 

3-operand instruction 10-48 to 10-49 
bitwise logical-complement 1 0-1 48 
bitwise logical-OR 10-151 

3-operand instruction 10-152 to 10-153 
BR instruction 10-60 

branch conditionally (delayed) 1 0-58 to 1 0-59 
branch conditionally (standard) 1 0-56 to 1 0-57 
branch unconditionally (delayed) 1 0-61 
branch unconditionally (standard) 10-60 
BRD instruction 10-61 
CALL instruction 1 0-62 
call subroutine 10-62 
call subroutine conditionally 1 0-63 to 1 0-64 
CALLconcf instruction 1 0-63 to 1 0-64 
categories 
ii legal 10-9 

interlocked operation 10-6 

load and store 10-2 

low-power control 1 0-5 

parallel operation 10-7 to 10-8 

program control 10-5 

three-operand 10-4 

two-operand 10-3 
CMPF instruction 1 0-65 
CMPF3 instruction 1 0-66 to 1 0-67 
CMPI instruction 10-68 
CMPI3 instruction 1 0-69 to 1 0-70 
compare floating-point 1 0-65 

3-operand instruction 10-66 to 10-67 
compare integer 1 0-68 

3-operand instruction 10-69 to 10-70 
condition codes 1 0-1 0 to 1 0-1 3 
condition for execution 1 0-1 0 to 1 0-1 3 
DBcond instruction 1 0-71 to 1 0-72 



assembly language instructions (continued) 
DBcondD instruction 1 0-73 to 1 0-74 
decrement and branch conditionally 

delayed 10-73 to 10-74 

standard 10-71 to 10-72 
example instruction 1 0-1 9 to 1 0-21 
FIX and STI instructions (parallel) 

10-77 to 10-78 
FIX instruction 1 0-75 to 1 0-76 
FLOAT and STF instructions (parallel) 

10-80 to 10-81 
FLOAT instruction 1 0-79 
floating-point-to-integer conversion 

10-75 to 10-76 
IACK instruction 1 0-82 
IDLE instruction 10-83 
idle until interrupt 1 0-83 
IDLE2 instruction 1 0-84 to 1 0-85 
individual instructions 1 0-1 4 to 1 0-21 0 
integer to floating-point conversion 1 0-79 
interrupt acknowledge 1 0-82 
LDE instruction 10-86 
LDF and LDF instructions (parallel) 

10-91 to 10-92 
LDF and STF instructions (parallel) 

10-93 to 10-94 
LDF instruction 10-87 
LDFcond instruction 1 0-88 to 1 0-89 
LDFI instruction 10-90 
LDI and LDI instructions (parallel) 

10-1 00 to 10-101 
LDI and STI instructions (parallel) 

10-102 to 10-103 
LDI instruction 1 0-95 to 1 0-96 
LDI cond instruction 1 0-97 to 1 0-98 
LDI I instruction 10-99 
LDM instruction 1 0-1 04 
LDP instruction 10-105 
load data page pointer 1 0-1 05 
load floating-point 1 0-87 

interlocked 10-90 
load floating-point conditionally 1 0-88 to 1 0-89 
load floating-point exponent 1 0-86 
load floating-point mantissa 1 0-1 04 
load integer 1 0-95 to 1 0-96 

interlocked 10-99 
load integer conditionally 1 0-97 to 1 0-98 
logical shift 1 0-1 07 to 1 0-1 08 

3-operand instruction 10-109 to 10-111 
LOPOWER instruction 1 0-1 06 
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assembly language instructions (continued) 
low-power idle 1 0-84 to 1 0-85 
LSH instruction 1 0-1 07 to 1 0-1 08 
LSH3 and STI instructions (parallel) 

10-112 to 10-114 
LSH3 instruction 1 0-1 09 to 10-111 
MAXSPEED instruction 10-115 
MPYF instruction 10-116 
MPYF3 and ADDF3 instructions (parallel) 

10-119 to 10-121 
MPYF3 and STF instructions (parallel) 

10-1 22 to 10-123 
MPYF3 and SUBF3 instructions (parallel) 

10-124 to 10-126 
MPYF3 instruction 1 0-1 1 7 to 1 0-1 1 8 
MPYI instruction 10-127 
MPYI3 and ADDI3 instructions (parallel) 

10-1 30 to 10-132 
MPYI3 and STI instructions (parallel) 

10-133 to 10-134 
MPYI3 and SUBI3 instructions (parallel) 

10-135 to 10-137 
MPYI3 instruction 1 0-1 28 to 1 0-1 29 
multiply floating-point 10-116 

3-operand instruction 10-117 to 10-118 
multiply integer 3-operand instruction 

10-128 to 10-129 
multiply integer instruction 1 0-1 27 
negative floating-point 1 0-1 39 
negative integer 1 0-1 42 
negative integer with borrow 1 0-1 38 
NEGB instruction 1 0-1 38 
NEGF and STF instructions (parallel) 

10-1 40 to 10-141 
NEGF instruction 1 0-1 39 
NEGI and STI instructions (parallel) 

10-143 to 10-144 
NEGI instruction 10-142 
no operation 10-145 
NOP instruction 1 0-1 45 
NORM instruction 1 0-1 46 to 1 0-1 47 
normalize 1 0-1 46 to 1 0-1 47 
NOT and STI instructions (parallel) 

10-1 49 to 10-150 
NOT instruction 1 0-1 48 
OR instruction 10-151 
OR3 and STI instructions (parallel) 

10-1 54 to 10-155 
OR3 instruction 1 0-1 52 to 1 0-1 53 



assembly language instructions (continued) 
parallel ABSF and STF instructions 

10-23 to 10-24 
parallel ABSI and STI instructions 

10-27 to 10-28 
parallel ADDF3 and MPYF3 instructions 

10-119 to 10-121 
parallel ADDF3 and STF instructions 

10-35 to 10-36 
parallel ADDI3 and MPYI3 instructions 

10-1 30 to 10-132 
parallel ADDI3 and STI instructions 

10-40 to 10-41 
parallel AND3 and STI instructions 

10-45 to 10-46 
parallel ASH3 and STI instructions 

10-54 to 10-55 
parallel FIX and STI instructions 1 0-77 to 1 0-78 
parallel FLOAT and STF instructions 

10-80 to 10-81 
parallel instructions advantages 1 1 -1 32 
parallel LDF and LDF instructions 

10-91 to 10-92 
parallel LDF and STF instructions 

10-93 to 10-94 
parallel LDI and LDI instructions 

10-1 00 to 10-101 
parallel LDI and STI instructions 

10-1 02 to 10-103 
parallel LSH3 and STI instructions 

10-112 to 10-114 
parallel MPYF3 and ADDF3 instructions 

10-119 to 10-121 
parallel MPYF3 and STF instructions 

10-1 22 to 10-123 
parallel MPYF3 and SUBF3 instructions 

10-1 24 to 10-126 
parallel MPYI3 and ADDI3 instructions 

10-1 30 to 10-132 
parallel MPYI3 and STI instructions 

10-1 33 to 10-134 
parallel MPYI3 and SUBI3 instructions 

10-1 35 to 10-137 
parallel NEGF and STF instructions 

10-1 40 to 10-141 
parallel NEGI and STI instructions 

10-1 43 to 10-144 
parallel NOT and STI instructions 

10-1 49 to 10-150 
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assembly language instructions (continued) 
parallel OR3 and STI 

instructions 1 0-1 54 to 1 0-1 55 
parallel STF and ABSF instructions 

10-23 to 10-24 
parallel STF and ADDF3 instructions 

10-35 to 10-36 
parallel STF and FLOAT instructions 

10-80 to 10-81 
parallel STF and LDF instructions 

10-93 to 10-94 
parallel STF and MPYF3 instructions 

10-122 to 10-123 
parallel STF and NEGF instructions 

10-140 to 10-141 
parallel STF and STF instructions 

1 0-1 76 to 1 0-1 77, 1 0-1 80 to 1 0-1 81 
parallel STF and SUBF3 instructions 

10-1 90 to 10-191 
parallel STI and ABSI instructions 

10-27 to 10-28 
parallel STI and ADDI3 instructions 

10-40 to 10-41 
parallel STI and AND3 instructions 

10-45 to 10-46 
parallel STI and ASH3 instructions 

10-54 to 10-55 
parallel STI and FIX instructions 1 0-77 to 1 0-78 
parallel STI and LDI instructions 

10-102 to 10-103 
parallel STI and LSH3 instructions 

10-112 to 10-114 
parallel STI and MPYI3 instructions 

10-133 to 10-134 
parallel STI and NEGI instructions 

10-143 to 10-144 
parallel STI and NOT instructions 

10-1 49 to 10-150 
parallel STI and OR3 instructions 

10-1 54 to 10-155 
parallel STI and SUBI3 instructions 

10-1 95 to 10-196 
parallel STI and XOR3 instructions 

10-209 to 10-210 
parallel SUBF3 and MPYF3 instructions 

10-1 24 to 10-126 
parallel SUBF3 and STF instructions 

10-190 to 10-191 
parallel SUBI3 and MPYI3 instructions 

10-135 to 10-137 



assembly language instructions (continued) 
parallel SUBI3 and STI instructions 

10-1 95 to 10-196 
parallel XOR3 and STI instructions 

10-209 to 10-210 
POP floating-point 1 0-1 57 
POP integer instruction 1 0-1 56 
POPF instruction 1 0-1 57 
PUSH floating-point 1 0-1 59 
PUSH integer instruction 1 0-1 58 
PUSHF instruction 1 0-1 59 
register syntax 10-18 
repeat block 10-170 
repeat single 1 0-1 71 to 1 0-1 72 
restore clock to regular speed 10-115 
RETI cond instruction 1 0-1 60 to 1 0-1 61 
return from subroutine conditionally 1 0-1 62 
RETScond instruction 1 0-1 62 
return from interrupt conditionally 

10-1 60 to 10-161 
RND instruction 1 0-1 63 to 1 0-1 64 
ROL instruction 1 0-1 65 
ROLC instruction 1 0-1 66 to 1 0-1 67 
ROR instruction 10-168 
RORC instruction 1 0-1 69 
rotate 

left 10-165 

left through carry 10-166 to 10-167 
right 10-168 

right through carry 10-169 
round floating-point 1 0-1 63 to 1 0-1 64 
RPTB instruction 1 0-1 70 
RPTS instruction 1 0-1 71 to 1 0-1 72 
SIGI instruction 10-173 
signal, interlocked 1 0-1 73 
software interrupt 1 0-200 
STF and ABSF instructions (parallel) 

10-23 to 10-24 
STF and ADDF3 instructions (parallel) 

10-35 to 10-36 
STF and FLOAT instructions (parallel) 

10-80 to 10-81 
STF and LDF instructions (parallel) 

10-93 to 10-94 
STF and MPYF3 instructions (parallel) 

10-122 to 10-123 
STF and NEGF instructions (parallel) 

10-1 40 to 10-141 
STF and STF instructions (parallel) 

10-176 to 10-177 
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assembly language instructions (continued) 
STF and SUBF3 instructions (parallel) 

10-1 90 to 10-191 
STF instruction 10-174 
STFI instruction 1 0-1 75 
STI and ABSI instructions (parallel) 

10-27 to 10-28 
STI and ADDI3 instructions (parallel) 

10-40 to 10-41 
STI and AND3 instructions (parallel) 

10-45 to 10-46 
STI and ASH3 instructions (parallel) 

10-54 to 10-55 
STI and FIX instructions (parallel) 

10-77 to 10-78 
STI and LDI instructions (parallel) 

10-102 to 10-103 
STI and LSH3 instructions (parallel) 

10-112 to 10-114 
STI and MPYI3 instructions (parallel) 

10-1 33 to 10-134 
STI and NEGI instructions (parallel) 

10-143 to 10-144 
STI and NOT instructions (parallel) 

10-149 to 10-150 
STI and OR3 instructions (parallel) 

10-154 to 10-155 
STI and STI instructions (parallel) 

10-1 80 to 10-181 
STI and SUBI3 instructions (parallel) 

10-195 to 10-196 
STI and XOR3 instructions (parallel) 

10-209 to 10-210 
STI instruction 10-178 
STII instruction 10-179 
store floating-point 1 0-1 74 
store floating-point, interlocked 1 0-1 75 
store integer 10-178 
store integer, interlocked 1 0-1 79 
SUBB instruction 1 0-1 82 
SUBB3 instruction 1 0-1 83 to 1 0-1 84 
SUBC instruction 1 0-1 85 to 1 0-1 86 

integer division 11-27 to 11-30 
SUBF instruction 1 0-1 87 
SUBF3 and MPYF3 instructions (parallel) 

10-1 24 to 10-126 
SUBF3 and STF instructions (parallel) 

10-1 90 to 10-191 
SUBF3 instruction 1 0-1 88 to 1 0-1 89 
SUBI instruction 10-192 



assembly language instructions (continued) 
SUBI3 and MPYI3 instructions (parallel) 

10-1 35 to 10-137 
SUBI3 and STI instructions (parallel) 

10-1 95 to 10-196 
SUBI3 instruction 1 0-1 93 to 1 0-1 94 
SUBRB instruction 1 0-1 97 
SUBRF instruction 1 0-1 98 
SUBRI instruction 1 0-1 99 
subtract floating-point 1 0-1 87 

3-operand instruction 10-188 to 10-189 
subtract integer 1 0-1 92 

3-operand instruction 10-193 to 10-194 
subtract integer conditionally 1 0-1 85 to 1 0-1 86 
subtract integer with borrow 1 0-1 82 

3-operand instruction 10-183 to 10-184 
subtract reverse floating-point 1 0-1 98 

ouuu aui i ovoi so n iieCjoi i u- i 

subtract reverse integer with borrow 1 0-1 97 

SWI instruction 10-200 

symbols used to define 1 0-1 5 to 1 0-1 8 

syntax options 1 0-1 6 to 1 0-1 8 

test bit fields 10-203 

3-operand instruction 10-204 to 10-205 

trap conditionally 1 0-201 to 1 0-202 

TRAP cond instruction 1 0-201 to 1 0-202 

TSTB instruction 1 0-203 

TSTB3 instruction 1 0-204 to 1 0-205 

XOR instruction 1 0-206 

XOR3 and STI instructions (parallel) 
10-209 to 10-210 

XOR3 instruction 1 0-207 to 1 0-208 
auxiliary (AR0-AR7) registers 3-3 
auxiliary register ALUs 2-6 
auxiliary register arithmetic units (ARAUs) 5-5 



B 



bank switching 

external bus 1 2-1 3 to 1 2-1 8 

programmable 7-30 to 7-32 
bank switching techniques 1 2-1 3 to 1 2-1 9 
Bcond instruction 1 0-56 to 1 0-57 
BcondD instruction 1 0-58 to 1 0-59 
biquad 11-60 

bit manipulation 1 1 -23 to 1 1 -24 

bit-reversed addressing 5-29 to 5-30, 1 1 -25 
FFT algorithms 5-29 to 5-30 
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bitwise exclusive-OR instruction 1 0-206 
3-operand instruction 1 0-207 

bitwise logical-complement instruction 1 0-1 48 

bitwise logical-AND instruction 10-42 

3-operand instruction 1 0-43 
bitwise logical-ANDN instruction 1 0-47 

3-operand instruction 10-48 

bitwise logical-OR instruction 1 0-1 51 
3-operand instruction 1 0-1 52 

block 

moves 11-25 
repeat 11-18 
repeat modes 6-2 to 6-7 

control bits 6-3 

nested block repeats 6-7 

operation 6-3 to 6-4 

RC register value 6-6 to 6-7 

restrictions 6-6 

RPTB instruction 6-4 to 6-5 

RPTS instruction 6-5 
repeat registers (RC, RE, RS) 3-11 , 6-2 
size (BK) register 3-4 

block diagram 

architectural 2-3 

functional 1-5 
boot loader 3-26 

external memory loading 3-30 

interrupt and trap vector mapping 3-33 

invoking 3-26 

mode selection 3-29 

operations 3-26 

precautions 3-35 

serial-port loading 3-33 
boot loader source code G-1 to G-6 
BR instruction 10-60 
branch conflicts 9-4 to 9-6 

branch unconditionally (delayed) instruction 
10-58, 10-61 

branch unconditionally (standard) instruction 

10-56,10-60 
branches 6-8 

delayed 6-8 to 6-9, 11-17 

BRD instruction 10-61 
breakdown of numbers B-9 to B-1 0 
buffered signals 1 2-43 

MPSD 12-42 
buffering 12-41 



bulletin board service (BBS) B-5 to B-6 
bus operation 7-1 to 7-32 

external 2-26 

internal 2-22 
buses 

DMA 2-22 

program 2-22 
busy-waiting example 6-14 
byte-wide configured memory 3-31 



C 



C (HLL) routines 1 1 -1 31 to 1 1 -1 34 

C compiler B-2 

'C30, memory maps 2-1 4 

'C30 power dissipation D-1 to D-32 

FFT assembly code D-30 to D-32 

photo of l DD for FFT D-29 

summary D-28 
•C31 

memory maps 2-15 

interrupt and trap memory maps 3-34 

reserved memory locations 2-31 

'C3xDSPs 1-2 

cache 

architecture 3-21 to 3-23 
control bits 3-24 

cache clear bit (CC) 3-24 
cache enable bit (CE) 3-24 
cache freeze bit (CF) 3-25 
hit 3-23 
instruction 2-12 
memory 2-11,3-21 
algorithm 3-23 to 3-24 
architecture 3-21 
instruction 3-21 
miss 3-23 
segment 3-24 
word 3-23 
CALL instruction 6-1 0, 1 0-62 
call subroutine conditionally instruction 1 0-63 
call subroutine instruction 1 0-62 
CALLcond instruction 6-1 0, 1 0-63 to 1 0-64 
calls 6-10 to 6-11 
carry flag 10-12 
cautions x 

C-callable routines 11-131 
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central processing unit 2-4 
block diagram 2-5 
registers 2-8 

circular addressing 5-24 to 5-28 
algorithm 5-26 
circular buffer 5-24 
FIR filters 5-28,11-58 
operation 5-27 

clkout 8-21,8-22 

CLKR pins 8-20 

CLKXpins 8-19 

clock mode 

timer interrupt 8-11 

timer pulse generator 8-8 to 8-9 

clock oscillator circuitry 1 2-27 to 1 2-29 

clocking of memory accesses 9-23 to 9-30 
data loads and stores 9-24 to 9-30 
program fetches 9-23 

CMPF instruction 1 0-65 

CMPF3 instruction 1 0-66 to 1 0-67 

CMPI instruction 10-68 

CMPI3 instruction 1 0-69 to 1 0-70 

COMBO F-6 

companding 1 1 -53 to 1 1 -57 

compare floating-point instruction 1 0-65 
3-operand instruction 10-66 

compare integer instruction 1 0-68 
3-operand instruction 1 0-69 

compiler B-2 

compression 
A-law 11-56 
U-law 11-54 

computed GOTO 11-22 

condition codes and flags 1 0-1 0 to 1 0-1 3 

condition flags 1 0-1 0 to 1 0-1 3 
floating-point underflow 1 0-1 1 
latched floating-point underflow 1 0-1 1 
latched overflow 1 0-1 1 
negative 10-11 
overflow 10-12 
zero 10-11 

conditional-branch addressing modes 2-1 6, 5-23 

conditional delayed branches 6-8 
compare instructions 6-8 
extended-precision registers 6-8 



connector 

dimensions, mechanical 1 2-43 to 1 2-45 
12-pin header 12-39 

consumer electronics F-20 to F-24 

context switching 1 1 -1 1 to 1 1 -1 5 

context restore for 'C3x 1 1 -1 4 to 1 1 -1 6 
context save for 'C3x 1 1 -1 2 to 1 1 -1 3 

control registers, external interface 7-2 to 7-5 
expansion bus 7-5 to 7-6 
primary bus 7-3 to 7-4 

conversion 

floating-point to integer 4-22 to 4-23 
integer to floating-point 4-24 
time to frequency domain (FFTs) 
11-73 to 11-125 

counter 

example 6-14 
register (timer) 8-3, 8-8 

CPU 2-4 to 2-10 
block diagram 2-5 
general 2-4 
interrupt 

DMA interaction 6-30 

latency 6-30 

processing cycle 6-29 
interrupt flag register (IF) 3-9 
register file 2-7, 3-2 to 3-1 2 
registers 2-7 to 2-1 0, 3-2 to 3-1 2 

auxiliary (AR0-AR7) 2-8, 3-3 

block repeat (RS, RE) 3-11 

block size (BK) 2-9,3-4 

CPU/DMA interrupt enable (IE) 3-7 

data-page pointer (DP) 2-9, 3-4 

extended precision (R0-R7) 2-8, 3-3 

I/O flag (IOF) 2-9,3-10 

index (IR1JR0) 2-9,3-4 

interrupt enable (IE) 2-9, 3-7 

interrupt flag (IF) 2-9, 3-9 

list of 3-2 

program counter (PC) 2- 10, 2-22, 3- 1 1 
repeat count (RC) 2-10, 3- 1 1, 6-2 
repeat end address (RE) 2-10, 3-11, 6-2 
repeat start address (RS) 2-10, 3-11, 6-2 
reserved bits 3-12 
status register (ST) 2-9, 3-4, 10-11 
system stack pointer (SP) 2-9, 3-4 
transfer, with serial-port transmit polling 
8-38 to 8-39 
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current calculations D-26 to D-27 
average D-27 
data output D-26 to D-27 
processing D-26 



D 



D/A converter interface 1 2-23 to 1 2-26 
D/A input/output system 1 2-32 to 1 2-35 
DAC F-23 
data 

converters F-17 

loads and stores 9-24 to 9-29 

operations with parallel stores 9-27 to 9-29 
parallel multiplies and adds 9-29 
three-operand instructions 9-24 to 9-27 
two-operand instructions 9-24 
data formats 4-1 to 4-24 

floating-point formats 4-4 to 4-9 

conversion between formats 4-8 to 4-9 
extended-precision 4-6 to 4-7 
short 4-4 to 4-5 
single-precision 4-6 
floating-point to integer conversion 4-22 to 4-23 
floating-point addition and subtraction 

4-1 4 to 4-1 7 
floating-point multiplication 4-1 0 to 4-1 3 
integer formats 4-2 
short 4-2 

single-precision 4-2 to 4-3 
integer to floating-point conversion 4-24 
normalization using NORM 4-1 8 to 4-1 9 
rounding with RND 4-20 to 4-21 
unsigned-integer formats 4-3 

short 4-3 

single-precision 4-3 to 4-4 
data-page pointer (DP) register 2-9, 3-4 

data-rate timing operation 
fixed 8-30 

burst mode 8-30 

continuous mode 8-30 
variable 8-34 

burst mode 8-34 

continuous mode 8-35 
data-receive register 8-24 
data-transmit register 8-23, 8-27, 8-30, 8-32 
DBcond instruction 1 0-71 to 1 0-72 
DBcondD instruction 1 0-73 to 1 0-74 



debugger B-3 
decode unit 9-2 

decrement and branch conditionally (delayed) 

instruction 10-73 
decrement and branch conditionally (standard) 

instruction 10-71 
delayed branches 6-8 to 6-9, 11-17 
advantages 11-132 
conditional 6-8 
incorrectly placed 6-6 
dependencies D-2 to D-3 
dequeue (stacks) 5-31 , 5-33 
development support B-1 to B-1 0 
tools B-2 to B-6 

bulletin board service B-5 to B-6 
code generation tools B-2 
assembler/linker B-2 
C compiler B-2 
compiler B-2 
linker B-2 
digital filter design package B-2 
documentation B-5 
hotline B-5 
literature B-5 
seminars B-6 
system integration and debug 
tools B-3 to B-4 
debugger B-3 

emulation porting kit (EPK) B-4 to B-5 
emulator B-3 

evaluation module (EVM) B-3 
simulator B-3 
XDS510 emulator B-3 
technical training organization (TTO) work- 
shop B-6 
third parties B-4 
workshops B-6 
device suffixes B-9 to B-1 0 
diagnostic applications 1 2-45 to 1 2-46 
digital audio F-21 
digital electronics F-20 to F-24 
digital filter design package B-2 
dimensions, 1 2-pin emulator connector 

12-43 to 12-45 
direct 

addressing 5-4 
memory access 2-29 
disabled interrupts by branch 6-8 
displacements 5-5 
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dissipation, power D-1 to D-32 
algorithm partitioning D-4 
dependencies D-2 to D-3 
FFT assembly code D-30 to D-32 
photo of IDD for FFT D-29 
power requirements D-2 
power supply current requirements D-2 
test setup description D-4 to D-5 

divide clock by 1 6 instruction 1 0-1 06 

division 11 -26 to 11-33 

floating-point 1 1 -31 to 1 1 -33 

DMA 

architecture 2-29 
block moves 8-43, 1 1 -25 
buses 2-22 
channel 9-2 

channel synchronization 8-54 to 8-56 
controller 2-22, 8-43 to 8-64 

block diagram 2-29 
destination register 8-49 to 8-53 
destination/source address register 8-47 
general 2-29 

initialization reconfiguration 8-57 
interrupt 8-56 

CPU Interaction 6-30 

processing cycle 6-29 
interrupt-enable register 8-47 to 8-49 
maximum transfer rates 8-53 
memory transfer 8-49 to 8-53 
memory-mapped registers 8-43 
programming hints 8-57 to 8-58 
setup and use examples 8-58 to 8-64 
source register 8-49 to 8-53 
synchronization of channels 8-54 to 8-56 
timing 

expansion bus destination 8-52 
on-chip destination 8-50 
primary bus destination 8-51 
transfer-counter register 8-47 

documentation v, vii, B-5 

DR pins 8-20 

dry pack C-7 

dummy fetch 9-4 

DXpins 8-19 



E 



electrical 

characteristics 

pinout and pin assignments 13-2 to 13- 15 

signal descriptions 13- 16 to 13-24 

signal transition levels 13-29 

summary D-28 
specifications 1 3-25 to 1 3-28 

emulation porting kit (EPK) B-4 to B-5 

emulator B-3 

connection to target system 1 2-41 to 1 2-43 

MPSD mechanical dimensions 
12-43 to 12-45 
connector, mechanical dimensions 

12-43 to 12-45 
MPSD connector, 12-pin header 

12-39 to 12-40 
pod interface 12-40 
signal buffering 12-41 

emulator cable, signal timing, MPSD 
12-40 to 12-41 

emulator pod 

MPSD timings 12-41 
parameters 12-41 

evaluation module (EVM) B-3 

event counters 8-2 

example circuit 1 2-1 3 to 1 2-46 

example instruction 1 0-1 9 to 1 0-21 

execute unit 9-2 

expansion 
A-law 11-57 

bus. See expansion buses and external buses 
U-law 11-55 

expansion buses 7-2 

functional timing of operations 7-6 
I/O cycles 7-11 to 7-32 
programmable wait states 7-28 to 7-29 

expansion bus control register 7-5 to 7-6 

expansion bus interface 1 2-1 9 to 1 2-26 
A/D converter 12-19 
D/A converter 12-23 
ready generation 1 2-9 to 1 2-1 3 
functions 12-11 
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extended-precision 

arithmetic 11 -38 to 11-41 
floating-point format 4-6 to 4-7 
addition example 1 1 -39 
multiplication example 1 1 -40 
subtract example 1 1 -39 

extended-precision (R7-R0) registers 3-3 

external 

buses (expansion, primary) 2-26, 7-1 
bank switching 12- 13 to 12-18 
expansion bus interface 12-19 to 12-26 
external interrupts 2-26 
interlocked instructions 2-26 
primary bus interface 12-4 to 12- 18 
ready generation 12-9 to 12- 13 
wait states 12-9 to 12- 13 

devices 12-3 

interfaces 12-2 

external bus operation 2-26, 7-1 to 7-32 

external interface control registers 7-2 to 7-5 
expansion bus 7-5 to 7-6 
primary bus 7-3 to 7-4 
external interface timing 
expansion bus 7-6 to 7-27 
expansion-bus I/O cycles 7-11 to 7-32 
primary-bus cycles 7-6 to 7-10 
programmable bank switching 7-30 to 7-32 
programmable wait states 7-28 to 7-29 

external interface, control registers 7-2 to 7-5 

external interface timing 7-6 to 7-27 
expansion bus I/O cycles 7-11 to 7-32 
primary bus cycles 7-6 to 7-1 0 

external interrupts 6-23 

external memory loader header 3-30 

external ready generation 1 2-1 0 to 1 2-1 1 

external reset signal 6-1 8 



F 



fast Fourier transforms (FFT) 1 1 -25, 
11-73 to 11-125, D-26 

fetch unit 9-2 

FFT 11-73 to 11-125 

FFT algorithms 5-29 

bit-reversed addressing 5-29 



filters 11 -58 to 11-67 
adaptive 11-67 
FIR 11 -58 to 11 -60 
IIR 11 -60 to 11-66 
lattice 11 -125 to 11-130 
LMS algorithm 11-67 

FIR filters 5-28, 1 1 -58 to 1 1 -60 

circular addressing 5-28, 1 1 -58 
FIX and STI instructions (parallel) 1 0-77 to 1 0-78 
FIX instruction 1 0-75 to 1 0-76 
fixed data-rate timing operation, timing 8-30 
burst mode 8-30 
continuous mode 8-30 
fixed point 1-4 
flag 

carry 10-12 
condition 

floating-point underflow 10-11 
latched floating-point underflow 10-11 
latched overflow 10-11 
negative 10-11 
overflow 10-12 
zero 10-11 
FLOAT and STF instructions 
(parallel) 10-80 to 10-81 
FLOAT instruction 4-24, 1 0-79 
floatingpoint 1-4 
addition 4-1 4 to 4-1 7 

examples 4-1 6 to 4-1 8 
conversion to integer 4-22 to 4-23 
division 11-26, 11-31 to 11-33 
format 4-4 to 4-9 

conversion 4-8 to 4-9, 11-44 to 11-48, 

11 -49 to 11-52 
extended-precision 4-6 to 4-7 
IEEE definition 11-43 
short 4-4 to 4-5 
single-precision 4-6 
TMS320C3X definition 11-42 to 11-44 
IEEE to TMS320, 1 1 -42 to 1 1 -52 
inverse 11-31 to 11-33 
multiplication 4-1 0 to 4-1 3 
examples 4- 12 to 4- 14 
flowchart 4-11 
normalization 4-1 8 to 4-1 9 
normalized 4-14 
operation 4-1 to 4-24 
rounding value 4-20 to 4-21 
square root 11-34 
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floating point (continued) 

subtraction 4-1 4 to 4-1 7 
examples 4-16 to 4- 18 

TMS320 to IEEE 11 -42 to 1 1 -52 

underflow 4-15 
floating-point-to-integer conversion instruction 

10-75 

floating-point underflow condition flag 1 0-1 1 

frame sync 8-32, 8-33 

FSR pins 8-20 

FSXpins 8-19 

functional block diagram 1 -5 



G 



genera! addressing modes 2-1 6, 5-1 9 to 5-20 
general-purpose applications 1 -4 
generation, TMS320C3x DSPs 1 -2 
global memory 6-1 2, 6-1 5 
global-control register 8-2 
DMA 8-47 

register bits 8-45 to 8-47 
serial port 8-1 3, 8-1 5 to 8-1 8 
bits summary 8- 15 to 8- 18 
timer 8-3 to 8-8 

register bits summary 8-4 to 8-6 
GOTO 11-22 



H 



hardware applications 1 2-1 to 1 2-46 
expansion bus interface 1 2-1 9 to 1 2-26 
AID converter 12-1 9 to 12-22 
D/A converter 12-23 to 12-27 
low-power mode interrupt interface 

12-36 to 12-38 
primary bus interface 1 2-4 to 1 2-1 8 

bank switching techniques 12- 13 to 12-19 
ready generation 12-9 to 12- 13 
zero-wait-state to static-RAMs 12-4 to 12-8 
serial-port interface 1 2-32 to 1 2-35 
system configuration options 1 2-2 to 1 2-3 
categories of interfaces 12-2 
typical block diagram 12-3 to 12-4 
system control functions 1 2-27 to 1 2-31 
clock oscillator circuitry 12-27 to 12-29 
reset signal generation 12-29 to 12-39 



hardware applications (continued) 
XDS target design 

considerations 1 2-39 to 1 2-46 
connections between emulator and target 

system 12-41 to 12-43 
diagnostic applications 12-45 to 12-46 
mechanical dimensions for emulator 

connector 12-43 to 12-45 
MPSD emulator cable signal timing 

12-40 to 12-41 
MPSD emulator connector 12-39 to 12-40 

hardware control 6-1 
hardware reset 11-2 
HDTV F-20 
header 

12-pin 12-39 

dimensions 

mechanical 12-43 to 12-45 
12-pin header 12-39 

signal descriptions, 1 2-pin header 1 2-39 

straight, unshrouded 12-39 

hints for assembly coding 1 1 -1 31 to 1 1 -1 32 
hotline B-5 




I/O flags register (IOF) 3-1 0 

IACK instruction 6-29, 1 0-82 

IDLE instruction 10-83 

IDLE2 power management mode 6-36 to 6-37 

IDLE2 instruction 1 0-84 to 1 0-85, 1 2-36 to 1 2-38 

IE register bits summary, CPU register file 3-8 

IF register bits summary, CPU register file 3-9 

I/O flag register (IOF), CPU register file 3-1 0 

MR filters 11 -60 to 11 -66 

illegal instructions 10-9 

index (IR0.IR1) register 3-4 

indirect addressing 5-5 to 5-1 6 
ARAUs 5-5 
auxiliary register 5-5 
parallel addressing mode 5-22 
three-operand addressing mode 5-21 
with postdisplacement 5-1 0 
with postindex 5-1 4 to 5-1 7 
with predisplacement 5-8 to 5-1 0 
with preindex 5-1 2 to 5-1 4 
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individual instructions 1 0-1 4 to 1 0-21 0 
example 10-1 9 to 10-21 
symbols and abbreviations 1 0-1 4 to 1 0-1 5 

initialization 
DMA 8-57 

processor 1 1 -2 to 1 1 -5 

input clock 12-27 

instruction 
cache 3-21 
memory 

three-operand reads 9-24 to 9-27 
two-operand accesses 9-24 

opcodes A-1 to A-6 

register (IR) 2-22 

instruction cache 2-1 2 

instruction set 1 0-22 to 1 0-21 0 
categories 10-2 

example instruction 1 0-1 9 to 1 0-21 
summary 

alphabetical 2-17 to 2-21 

function listing 10-2 to 10-9 

table 2-1 7 to 2-21 

instructions 

assembly language 1 0-1 to 1 0-1 8 
illegal 10-9 

interlocked operations 1 0-6 
load-and-store 10-2 
low-power control operations 1 0-5 
parallel operations 1 0-7 to 1 0-8 
program control 10-5 
three-operand 10-4 
two-operand 10-3 

INT0-INT3 signals 3-1 8, 3-1 9, 6-24 

integer 

division 1 1 -26, 1 1 -27 to 1 1 -30 
format 4-2 

short integer 4-2 

signed 4-2 

single-precision integer 4-2 
unsigned 4-3 

integer-to-floating-point conversion 4-24 
instruction 10-79 

interfaces 

expansion bus 2-26, 1 2-1 9 to 1 2-26 
A/D converter interface 12-19 to 12-22 
D/A converter 12-23 to 12-26 

low-power-mode interrupt 1 2-36 to 1 2-38 



interfaces (continued) 

primary bus 2-26, 1 2-4 to 1 2-1 8 
See also primary bus interface 
bank switching techniques 12- 13 to 12-19 
ready generation 12-9 to 12- 13 
zero-wait-state to static RAMs 12-4 to 12-8 

serial port 1 2-32 to 1 2-35 

system control, clock circuitry 1 2-27 to 1 2-29 

types 12-2 

interlocked operations 6-1 2 to 6-1 7 
busy-waiting loop 6-1 4 
external flag pins (XF0, XF1 ) 6-12 
instructions 6-13 
loads and stores 6-12 
multiprocessor counter 6-1 4 

interlocked operations instructions 1 0-6 

internal 

bus operation 2-22 
clock 8-10 

internal circuitry current requirement D-5 to D-8 
internal bus operations D-6 to D-9 
internal operations D-5 
quiescent D-5 

internal interrupts 6-23 

interrupt 6-23 to 6-35 

acknowledge instruction 1 0-82 
enable (IE) register 3-7 

bits summary 3-8 
flag (IF) register 3-9 

bits summary 3-9 

interrupts 2-26 

considerations ('C3x) 6-31 to 6-34 

context switching 1 1 -1 1 to 1 1 -1 5 

context restore for 'C3x 11-14 to 11-16 
context save for 'C3x 11 -12 to 11-13 

control bits 6-26 to 6-27 
global control register 6-27 
interrupt enable register (IE) 6-26 
interrupt flag register (IF) 6-26 
status register (ST) 6-26 

CPU/DMA interaction 6-30 

DMA 8-56 

flag register behavior 6-27 
latency (CPU) 6-29 to 6-30 
prioritization and control 

6-25 to 6-26, 6-34 to 6-35, 11-16 
processing 6-27 to 6-30 
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interrupts (continued) 
serial port 8-29 
receive timer 8-29 
receiver 8-29 
transmit timer 8-29 
transmitter 8-29 
service routines 11-9 

example 11-16 
timer 8-2,8-11 
vectors 3-1 8, 3-1 9, 6-35 
table 6-23 to 6-25 
inverse 11-31 to 11-33 
inverse lattice filter 1 1 -1 26 
IOF register bits summary, CPU register file 3-1 1 
IOSTRB signal 7-2, 7-6 



K 



key features 
'C30 1-6 
'C31 1-8 




latched floating-point overflow and underflow 

condition flags 10-11 
lattice filters 1 1 -1 25 to 1 1 -1 30 
LDE instruction 10-86 

LDF and LDF instructions (parallel) 10-91 to 10-92 

LDF and STF instructions (parallel) 1 0-93 to 1 0-94 

LDF instruction 10-87 

LDFcond instruction 1 0-88 to 1 0-89 

LDFI instruction 10-90 

LDI and LDI instructions (parallel) 

10-1 00 to 10-101 
LDI and STI instructions (parallel) 

10-1 02 to 10-103 
LDI instruction 1 0-95 to 1 0-96 
LDI cond instruction 1 0-97 to 1 0-98 
LDII instruction 10-99 
LDM instruction 1 0-1 04 
LDP instruction 10-105 
linker B-2 

literature vtoviii B-5 

LMS algorithm filters 1 1 -67 

load data page pointer instruction 1 0-1 05 



load floating-point conditional instruction 1 0-88 
load floating-point exponent instruction 1 0-86 
load floating-point mantissa instruction 1 0-1 04 
load floating-point interlocked instruction 1 0-87 
load integer conditionally instruction 1 0-97 
load integer instruction 1 0-95 
load integer, interlocked instruction 10-99 
load-and-store instructions 1 0-2 
loader mode selection 3-30 
logical operations 1 1 -23 to 1 1 -34 
bit manipulation 1 1 -23 to 1 1 -24 
bit-reversed addressing 1 1 -25 to 1 1 -26 
block moves 11-25 

extended-precision arithmetic 1 1 -38 to 1 1 -41 

floating-point format conversion 1 1 -42 to 1 1 -52 

integer and floating-point division 
11-26 to 11-33 

square root 11-34 
logical shift instruction 1 0-1 07 

3-operand instruction 1 0-1 09 
long-immediate addressing 2-1 6, 5-1 7 
looping 11 -18 to 11-21 

block repeat 1 1 -1 8 to 1 1 -20 

single-instruction repeat 1 1 -20 to 1 1 -26 
LOPOWER instruction 1 0-1 06 
LOPOWERmode 6-38 
low-power control instructions 1 0-5 
low-power idle instruction 1 0-84 
low-power-mode interrupt interface 1 2-36 to 1 2-38 
low-power-mode wakeup example 

11-133 to 11-134 
LRU cache update 3-21 
LSH instruction 1 0-1 07 to 1 0-1 08 
LSH3 and STI instructions (parallel) 

10-112 to 10-114 
LSH3 instruction 1 0-1 09 to 1 0-1 1 1 



M 



matrix-vector multiplication 1 1 -70 
MAXSPEED instruction 10-115 
memory 2-11 , 3-13, 3-21 

accesses (pipeline) clocking 9-23 to 9-29 

addressing modes 2-1 6 

cache 2-11,3-21,11-132 
See also cache 

DMA memory transfer 8-49 to 8-53 



Index- 14 



Index 



memory (continued) 

general organization 2-1 1 

global 6-12,6-15 

maps 2-13,3-13,3-17 
'C30 2-14,3-15 
'C31 2-15,3-16 

microcomputer mode 3-1 3 

microprocessor mode 3-1 3 

pipeline conflicts 9-1 0 to 9-1 7 
execute only 9- 13 to 9-15 
hold everything 9-1 5 to 9-1 7 
program fetch incomplete 9- 12 
program wait 9-10 to 9- 13 
resolving 9-21 to 9-22 

quick access 11-132 
memory addressing 

modes 2-16 

parallel multiplies and adds 9-29 

three-operand instructions 9-24 

two-operand instructions 9-24 
memory maps 

'C30 2-14,3-15 

'C31 2-15,3-16 
memory organization, block diagram 2-1 2 
microcomputer mode 2-1 3, 3-1 4, 3-1 7 
microcomputer/boot loader mode 3-1 7 
microprocessor mode 2-1 3, 3-1 3, 3-1 7 
modem applications F-1 7 to F-1 9 
MPSD emulator 

buffered transmission signals 1 2-42 

cable signal timing 1 2-40 to 1 2-41 

connector 1 2-39 to 1 2-40 

no signal buffering 1 2-41 
MPYF instruction 9-4, 10-116 
MPYF3 and ADDF3 instructions (parallel) 

10-119 to 10-121 
MPYF3 and STF instructions (parallel) 

10-1 22 to 10-123 
MPYF3 and SUBF3 instructions (parallel) 

10-1 24 to 10-126 
MPYF3 instruction 1 0-1 1 7 to 1 0-1 1 8 
MPYI instruction 10-127 
MPYI3 and ADDI3 instructions (parallel) 

10-1 30 to 10-132 
MPYI3 and STI instructions (parallel) 

10-1 33 to 10-134 

MPYI3 and SUBI3 instructions (parallel) 
10-1 35 to 10-137 



MPYI3 instruction 1 0-1 28 to 1 0-1 29 
MSTRB signal 7-2,7-6 
multimedia applications F-2 to F-4 

multimedia-related devices F-4 

system design considerations F-2 to F-3 
multiple processors 6-1 2 
multiplication 

floating-point 4-10 

examples 4- 12 to 4-1 4 
flowchart 4-11 

matrix-vector 1 1 -70 to 1 1 -73 
multiplier 2-6 

multiply floating-point instruction 10-116 

3-operand instruction 10-117 
multiply integer instruction 1 0-1 27 

3-operand instruction 1 0-1 28 
multiprocessor support 6-1 2 



N 



negative condition flag 1 0-1 1 

negative floating-point instruction 1 0-1 39 

negative integer instruction 1 0-1 42 

negative integer with borrow instruction 1 0-1 38 

NEGB instruction 10-138 

NEGF and STF instructions (parallel) 

10-1 40 to 10-141 
NEGF instruction 10-139 
NEGI and STI instructions (parallel) 

10-1 43 to 10-144 
NEGI instruction 10-142 
nested block repeats 6-7 
no operation instruction 1 0-1 45 
NOP instruction 10-145 

NORM instruction 4-1 8 to 4-1 9, 1 0-1 46 to 1 0-1 47 
normalization, floating-point value 4-1 4, 

4-1 8 to 4-1 9 
normalize instruction 1 0-1 46 
NOT and STI instructions (parallel) 

10-1 49 to 10-150 
NOT instruction 1 0-1 48 



O 



operations with parallel stores 9-27 to 9-29 
optional assembler syntax 1 0-1 6 to 1 0-1 8 
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options overview (system configuration) 1 2-2 

OR instruction 10-151 

OR3 and STI instructions (parallel) 

10-1 54 to 10-155 
OR3 instruction 1 0-1 52 to 1 0-1 53 
ordering information B-7 to B-1 0 
ORing of the ready signals 1 2-9 to 1 2-1 0 

output driver circuitry current 

requirement D-9 to D-1 7 

capacitive load dependence D-1 6 to D-1 8 

data dependency D-1 4 to D-1 6 

expansion bus D-1 3 to D-1 4 

primary bus D-1 0 to D-1 2 
output value formats 1 0-1 0 
overflow 4-15,4-22 




parallel ABSF and STF instructions 1 0-23 to 1 0-24 
parallel ABSI and STI instructions 1 0-27 to 1 0-28 
parallel ADDF3 and MPYF3 instructions 
10-1 19 to 10-121 

parallel ADDF3 and STF instructions 

10-35 to 10-36 
parallel ADDI3 and MPYI3 instructions 

10-1 30 to 10-132 

parallel ADDI3 and STI instructions 1 0-40 to 1 0-41 
parallel addressing modes 2-16, 5-21 to 5-22 
parallel AND3 and STI instructions 1 0-45 to 1 0-46 
parallel ASH3 and STI instructions 1 0-54 to 1 0-55 

parallel bus 12-19 

See also expansion bus interface 

parallel FIX and STI instructions 1 0-77 to 1 0-78 
parallel FLOAT and STF instructions 

10-80 to 10-81 
parallel instruction set summary 2-23 to 2-24 
parallel instructions advantages 1 1 -1 32 
parallel LDF and LDF instructions 10-91 to 10-92 
parallel LDF and STF instructions 1 0-93 to 1 0-94 
parallel LDI and LDI instructions 1 0-1 00 to 1 0-1 01 
parallel LDI and STI instructions 1 0-1 02 to 1 0-1 03 

parallel LSH3 and STI instructions 
10-112 to 10-114 



parallel MPYF3 and ADDF3 instructions 

10-119 to 10-121 
parallel MPYF3 and STF instructions 

10-1 22 to 10-123 
parallel MPYF3 and SUBF3 instructions 

10-1 24 to 10-126 
parallel MPYI3 and ADDI3 instructions 

10-1 30 to 10-132 
parallel MPYI3 and STI instructions 

10-1 33 to 10-134 
parallel MPYI3 and SUBI3 instructions 

10-135 to 10-137 
parallel multiplies and adds 9-29 
parallel NEGF and STF instructions 

10-1 40 to 10-141 
parallel NEGI and STI instructions 

10-143 to 10-144 
parallel NOT and STI instructions 

10-1 49 to 10-150 
parallel operations instructions 1 0-7 to 1 0-8 
parallel OR3 and STI instructions 

10-1 54 to 10-1 55 
parallel STF and ABSF instructions 1 0-23 to 1 0-24 
parallel STF and ADDF3 instructions 

10-35 to 10-36 
parallel STF and FLOAT instructions 

10-80 to 10-81 
parallel STF and LDF instructions 1 0-93 to 1 0-94 
parallel STF and MPYF3 instructions 

10-1 22 to 10-123 
parallel STF and NEGF instructions 

10-1 40 to 10-141 
parallel STF and STF instructions 

10-1 76 to 10-177 
parallel STF and SUBF3 instructions 

10-1 90 to 10-191 
parallel STI and ABSI instructions 1 0-27 to 1 0-28 
parallel STI and ADDI3 instructions 1 0-40 to 1 0-41 
parallel STI and AND3 instructions 1 0-45 to 1 0-46 
parallel STI and ASH3 instructions 1 0-54 to 1 0-55 
parallel STI and FIX instructions 1 0-77 to 1 0-78 
parallel STI and LDI instructions 1 0-1 02 to 1 0-1 03 
parallel STI and LSH3 instructions 

10-112 to 10-114 
parallel STI and MPYI3 instructions 

10-1 33 to 10-134 
parallel STI and NEGI instructions 

10-1 43 to 10-144 
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parallel STI and NOT instructions 

10-1 49 to 10-150 
parallel STI and OR3 instructions 

10-1 54 to 10-155 
parallel STI and STI instructions 1 0-1 80 to 1 0-1 81 
parallel STI and SUBI3 instructions 

10-195 to 10-196 

parallel STI and XOR3 instructions 
10-209 to 10-210 

parallel SUBF3 and MPYF3 instructions 

10-1 24 to 10-126 
parallel SUBF3 and STF instructions 

10-190 to 10-191 
parallel SUBI3 and MPYI3 instructions 

10-135 to 10-137 
parallel SUBI3 and STI instructions 

10-195 to 10-196 
parallel XOR3 and STI instructions 

10-209 to 10-210 

part numbers B-7 to B-1 0 

breakdown of numbers B-9 to B-1 0 
device suffixes B-9 to B-1 0 
prefix designators B-8 to B-9 

part ordering B-1 to B-1 0 

PC-relative addressing 5-1 7 to 5-1 8 

period register (timer) 8-2, 8-8 

peripheral bus 2-27 

general architecture 2-27 
map 3-20 
peripherals on 

DMA controller 8-43 to 8-64 

serial port 2-28, 8- 13 to 8-42 

timers 2-28, 8-2 
register diagram 2-27 

peripheral modules, block diagram 2-27 

peripherals 2-27, 8-1 to 8-64 
DMA controller 8-43 to 8-64 

CPU/DMA interrupt enable register 

8-47 to 8-49 
destination- and source-address registers 
8-47 

global-control register 8-47 
hints for programming 8-57 to 8-58 
initialization/reconfiguration 8-57 
interrupts 8-56 

memory transfer operation 8-49 to 8-53 
programming examples 8-58 to 8-64 



peripherals, DMA controller (continued) 
synchronization of DMA channels 

8-54 to 8-56 
transfer-counter register 8-47 
serial ports 8-1 3 to 8-42 
data-transmit register 8-23 
data-receive register 8-24 
FSR/DR/CLKR port control register 8-20 
FSX/DX/CLKX port control register 

8-18 to 8-19 
functional operation 8-30 to 8-36 
global-control register 8- 15 to 8- 18 
initialization/reconfiguration 8-36 
interrupt sources 8-29 
operation configurations 8-24 to 8-26 
receive/transmit timer control register 

8-21 to 8-22 
receive/transmit timer counter register 8-22 
receive/transmit timer period register 8-23 
timing 8-26 to 8-29 
TMS320C3x interface examples 

8-36 to 8-46 
timers 8-2 to 8-12 

global-control register 8-3 to 8-8 
initialization/reconfiguration 8- 12 to 8- 15 
interrupts 8-11 
operation modes 8-10 to 8-11 
period and counter registers 8-8 
pulse generation 8-8 to 8-9 

pin 

assignments 13-6, 13-7 
states at reset 6-19 

pinout and pin assignments 1 3-2 to 1 3-1 5 
PGA 13-2 to 13-7 
PQFP 

'C30 13-8 to 13-11 

'C31 13-12 to 13-15 

pipeline 

conflicts 9-4 to 9-17 

avoiding 11-132 

delayed branches 9-6 

registers 9-7 to 9-9 

standard branches 9-4 to 9-6 
memory accesses clocking 9-23 to 9-30 
memory conflicts 9-1 0 to 9-1 7 

execute only 9-13 to 9-15 

hold everything 9-15 to 9-17 

program fetch incomplete 9- 12 

program wait 9-10 to 9- 13 

resolving 9-21 to 9-22 
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pipeline (continued) 
operation 9-1 to 9-30 

clocking of memory accesses 9-23 to 9-30 

data loads and stores 9-24 to 9-30 

program fetches 9-23 

branch conflicts 9-4 to 9-6 

memory conflicts 9-1 0 to 9-23 

register conflicts 9-7 to 9-9 
resolving memory conflicts 9-21 to 9-22 
resolving register conflicts 9-1 8 to 9-20 
structure 9-2 to 9-3 

pod interface, emulator 12-40 

POP floating-point instruction 1 0-1 57 

POP integer instruction 1 0-1 56 

POPF instruction 1 0-1 57 

power dissipation D-1 to D-32 
algorithm partitioning D-4 
characteristics D-2 to D-4 
dependencies D-2 to D-3 
FFT assembly code D-30 to D-32 
photo of IDD for FFT D-29 
power requirements D-2 
power supply current requirements D-2 
summary D-28 

test setup description D-4 to D-5 

power supply current requirements D-2 

PQFP reflow soldering precautions C-7 to C-8 

prefix designators B-8 to B-9 

primary bus 7-2 

See also external buses 
bus cycles 7-6 to 7-10 
control register 7-3 to 7-4 
functional timing of operations 7-6 
programmable bank switching 7-31 
programmable wait states 7-28 to 7-29 
ready generation, segmentation of address 
space 12-11 

primary bus interface 2-26, 1 2-4 to 1 2-1 8 
bank switching techniques 1 2-1 3 to 1 2-1 9 
ready generation 1 2-9 to 1 2-1 3 
ANDing of the ready signals 12-10 
example circuit 12-13 to 12-46 
external ready generation 12-10 to 12- 1 1 
ORing of the ready signals 12-9 to 12-10 
ready control logic 12- 11 to 12- 12 
zero-wait-state to static-RAMs 1 2-4 to 1 2-8 

processor initialization 1 1 -2 to 1 1 -5 



program 

buses 2-22 
counter (PC) 2-22,3-11 
fetches 9-23 
flow 6-1 

program control 11-6 

computed GOTOs 1 1 -22 to 1 1 -23 
delayed branches 11-17 
instructions 10-5 

interrupt service routines 1 1 -9 to 1 1 -1 6 
context switching 11-11 to 11-16 
example 11-16 
priority 11-16 

repeat modes 1 1 -1 8 to 1 1 -21 
block repeat 11 -18 to 11-20 
single-instruction repeat 11-20 to 11-26 

software stack 1 1 -8 to 1 1 -9 

subroutines 1 1 -6 to 1 1 -8 

program fetch incomplete 9-1 2 

program flow control 6-1 to 6-38 
calls, traps, and returns 6-1 0 to 6-1 1 
delayed branches 6-8 to 6-9 
interlocked operations 6-1 2 to 6-1 7 
interrupts 6-23 to 6-35 

control bits 6-26 to 6-27 

CPU interrupt latency 6-30 

CPU/DMA interaction 6-30 

prioritization 6-25 to 6-26 

prioritization and control 6-34 to 6-36 

processing 6-27 to 6-30 

TMS320C30 considerations 6-32 to 6-34 

TMS320C3X considerations 6-31 to 6-32 

vector table 6-23 to 6-25 
repeat modes 6-2 to 6-7 

nested block repeats 6-7 to 6-23 

RC register value after repeat mode 
6-6 to 6-7 

repeat-mode control bits 6-3 

repeat-mode operation 6-3 to 6-4 

restrictions 6-6 

RPTB instruction 6-4 to 6-5 

RPTS instruction 6-5 
reset operation 6-1 8 to 6-22 
TMS320LC31 power management 

mode 6-36 to 6-38 

IDLE2 6-36 to 6-37 

LOPOWER 6-38 

program wait 9-1 0 to 9-1 3 
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programmable 

bank switching 7-30 to 7-32 

wait states 7-28 to 7-29 
programming tips 1 1 -1 31 to 1 1 -1 34 

C-callable routines 11-131 

hints for assembly coding 1 1 -1 31 to 1 1 -1 32 

low-power mode wakeup example 
11-133 to 11-134 
pulse mode 

timer interrupt 8-11 

timer pulse generator 8-8 to 8-9 
PUSH floating-point instruction 1 0-1 59 
PUSH integer instruction 1 0-1 58 
PUSHF instruction 10-159 



Q 



quality C-1 to C-8 
queue (stacks) 5-31 , 5-33 



R 



RAM. See memory 

RC register value 6-6 to 6-7 

read unit 9-2 

ready control logic 1 2-1 1 to 1 2-1 2 

ready generation 1 2-9 to 1 2-1 3 
ANDing of the ready signals 1 2-1 0 
example circuit 1 2-1 3 to 1 2-46 
external ready generation 1 2-1 0 to 1 2-1 1 
functions 12-11 

ORing of the ready signals 1 2-9 to 1 2-1 0 
ready control logic 1 2-1 1 to 1 2-1 2 

receive shift register (RSR) 8-24 

receive/transmit timer 

control register (serial port) 8-21 to 8-22 
counter register (serial port) 8-22 
period register (serial port) 8-23 

reflow soldering precautions C-7 to C-8 

register addressing 5-3 

register conflicts 9-7 to 9-9 

register file, CPU 2-7 

registers 

auxiliary (AR7-AR0) 3-3 
block size (BK) 2-9, 3-4, 5-24 
buses 2-22 



registers (continued) 

conflicts (resolving) 9-1 8 to 9-20 
counter (timer) 8-8 
CPU interrupt flag (IF) 3-9 
CPU/DMA interrupt-enable (IE) 3-7, 

8-47 to 8-49 
data-page pointer (DP) 3-4 
destination, extended-precision registers 

(RO-R7) 6-8 
destination register (R7-R0) 

condition flags 10-20 
DMA 

destination and source address 8-47 
global-control register 8-47 
transfer-counter register 8-47 

extended precision (R0-R7) 2-8, 3-3 

FSR/DR/CLKR serial port control 8-20 

FSX/DX/CLKX serial port control 8-1 8 

functional groups 9-7 

I/O flag (IOF) 2-9,3-10 

index (IR0, IR1) 2-9,3-4 

interrupt enable (IE) 2-9 

interrupt flag (IF) 2-9, 6-33 

maximum use 11-132 

memory-mapped peripheral 3-20 

period (timer) 8-8 

program counter (PC) 2-1 0, 2-22, 3-1 1 

receive/transmit timer control 8-21 

repeat 

count (RC) 2-10 
count address (RC) 6-2 
end address (RE) 2-10, 6-2 
start address (RS) 2- 10, 6-2 

repeat mode operation 6-3 to 6-4 

reserved bits 3-12 

serial port 8-13 to 8-42 

serial port global-control 8-1 5 to 8-1 8 
bits summary 8- 15 to 8-18 

status (ST) 3-4 

status register (ST) 2-9, 10-11 

system stack pointer (SP) 2-9, 3-4, 5-31 

timer global-control 8-3 

reliability C-1 to C-8 
stress testing C-2 to C-6 

repeat 

count register (RC) 3-11, 6-2 
end address register (RE) 3-1 1 , 6-2 
mode 6-2 to 6-7, 11 -18 to 11-21 
block repeat 11-18 to 11-20 
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repeat, mode (continued) 
control bits 6-3 

maximum number of repeats 6-3 
nested block repeats 6-7 
operation 6-3 to 6-4 
RC register value 6-6 to 6-7 
restrictions 6-6 
RPTB instruction 6-4 to 6-5 
RPTS instruction 6-5 
single-instruction repeat 11-20 to 11-26 
start address register (RS) 3-11 , 6-2 
repeat block instruction 1 0-1 70 
reserved area, unpredictable results 2-13 
reserved memory locations 

TMS320C31.2-31 
reset 3-17 

operation 5-1 8 to 6-22 
pin states 6-19 
vectors 3-1 8, 3-1 9, 6-35 
RESET signal, generation 1 2-29 to 1 2-31 
resolving register conflicts 9-1 8 to 9-20 
restore clock to regular speed instruction 10-115 
RETIconof instruction 6-1 0, 1 0-1 60 to 1 0-1 61 
RETScond instruction 6-1 0, 1 0-1 62 
return from interrupt conditionally instruction 
10-160 

return from subroutine 6-1 0 
return from subroutine conditionally instruction 
10-162 

returns 6-10 to 6-11 

RINT0, RINT1 signals 3-18, 3-19, 6-24 

RND instruction 1 0-1 63 to 1 0-1 64 

ROL instruction 1 0-1 65 

ROLC instruction 1 0-1 66 to 1 0-1 67 

ROM. See memory 

ROR instruction 1 0-1 68 

RORC instruction 1 0-1 69 

rotate left instruction 1 0-1 65 

rotate left through carry instruction 1 0-1 66 

rotate right instruction 1 0-1 68 

rotate right through carry instruction 1 0-1 69 

round floating-point instruction 1 0-1 63 

rounding of floating-point value 4-20 to 4-21 

RPTB instruction 6-4 to 6-5, 1 0-1 70 

RPTS instruction 6-5, 1 0-1 71 to 1 0-1 72 



S 



scan paths, TBC emulation connections for 'C3x 
12-46 

segment start address (SSA) 3-21 
segmentation of address space 1 2-1 1 
semaphores 6-15 
seminars B-6 
serial port 8-1 3 to 8-42 
clock 8-13,8-27 

timer 8-37 

timing 8-26 to 8-29 
clock configurations 8-24 to 8-26 
continuous transmit and receive mode 8-28 
CPU transfer with transmit polling 8-38 to 8-39 
data-receive register 8-24 
data-transmit register 8-23 
fixed date-rate timing 8-30 

burst mode 8-30 

continuous mode 8-30 
frame sync 8-32, 8-33 
functional operation 8-30 to 8-36 
global-control register 8-1 3, 8-1 5 to 8-1 8 

bits summary 8-15 to 8-18 
handshake mode 8-1 6, 8-28 to 8-30, 8-37, 8-38 

direct connect 8-29 
initialization reconfiguration 8-36 to 8-42 
interface 1 2-32 to 1 2-35 

handshake mode example 8-37 to 8-38 

serial A/C interface example 8-40 

serial A/D and DIA interface example 
8-40 to 8-46 
interrupt sources 8-29 

receive timer 8-29 

receiver 8-29 

transmit timer 8-29 

transmitter 8-29 
operation configurations 8-24 to 8-26 
port control register 

FSR/DR/CLKR 8-20 

FSR/DR/CLKR bits summary 8-20 

FSX/DX/CLKX 8-18 to 8-19 

FSX/DX/CLKX bits summary 8-19 
receiveAransmit timer 

control register 8-21 to 8-22 

counter register 8-22 

period register 8-23 
registers 8-13, 8-42 
timing 8-26 to 8-29 
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serial-port loading 3-33 

servo control/disk drive applications F-1 4 to F-1 6 
servo control-related devices F-1 6 
short-immediate addressing 5-1 6 to 5-1 7 
SIGI instruction 10-173 
signal 

descriptions 1 3-1 6 to 1 3-24 

'C30 13-1 6 to 13-21 

'C31 13-22 to 13-29 
transition levels 13-29 

TTL-Ievel inputs 13-29 to 13-30 

TTL-level outputs 13-29 

signal buffering for emulator connections 1 2-41 

signal descriptions 1 3-1 , 1 3-1 6 to 1 3-24 
pinout and pin assignments 1 3-2 to 1 3-1 5 

signal, interlocked instruction 1 0-1 73 

signals 

12-pin header 12-39 
buffered 12-39,12-43 
buffering for emulator connections 

12-41 to 12-43 
no buffering 12-41 
timing 12-40 to 12-41 

signed-precision, unsigned integer format 4-3 

simulator B-3 

single-instruction repeat 1 1 -20 to 1 1 -21 
single-precision 

floating-point format 4-6 

integer format 4-2 

1 6-bit-wide configured memory 3-32 

software applications 1 1 -1 to 1 1 -34 

application-oriented operations 1 1 -53 to 1 1 -67 

adaptive filters 1 1 -67 

companding 11-53 to 11-57 

fast Fourier transforms (FFT) 
11-73 to 11-125 

FIR filters 11-58 to 11-60 

IIR filters 11 -60 to 11-66 

lattice filters 11-125 to 11-131 

matrix-vector multiplication 11-70 to 11-73 
logical and arithmetic operations 1 1 -23 to 1 1 -34 

bit manipulation 11-23 to 11-24 

bit-reversed addressing 11-25 to 11-26 

block moves 11-25 

extended-precision arithmetic 11-38 to 11-41 
floating-point format conversion 
11 -42 to 11-53 



software applications, logical and arithmetic 
operations (continued) 

integer and floating-point division 
11 -26 to 11-33 

square root 11-34 
processor initialization 1 1 -2 
program control 1 1 -6 to 1 1 -22 

computed GOTOs 11-22 to 11-23 

delayed branches 11-17 

interrupt service routines 11 -9 to 11-16 

repeat modes 11-18 to 11-21 

software stack 11 -8 to 11-9 

subroutines 11-6 to 11-8 
programming tips 1 1 -1 31 to 1 1 -1 34 

C-callable routines 11-131 

hints for assembly coding 11-131 to 11-132 

low-power-mode wakeup example 
11-133 to 11-134 

software control 6-1 

software development tools B-2 to B-6 
bulletin board service (BBS) B-5 to B-6 
code generation tools B-2 

assembler/linker B-2 

C compiler B-2 

compiler B-2 

linker B-2 
digital filter design package B-2 
documentation B-5 
hotline B-5 
literature B-5 
seminars B-6 

system integration and debug tools B-3 to B-4 
debugger B-3 

emulation porting kit (EPK) B-4 to B-5 
emulator B-3 

evaluation module (EVM) B-3 
simulator B-3 
XDS510 emulator B-3 

technical training organization (TTO) work- 
shop B-6 

third parties B-4 

workshops B-6 

software interrupt instruction 1 0-200 

software stack 1 1 -8 to 1 1 -9 

soldering precautions 0-7 to C-8 

speech 

encoding F-3 
memories F-1 2 

synthesis applications F-11 to F-1 3 
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square root 11-34 

stack, software 1 1 -8 to 1 1 -9 

pointer (SP) register 3-4, 5-31 , 1 1 -8 to 1 1 -9 

stack management 5-31 to 5-34 

stack queues 5-33 

stacks 5-32 to 5-33 
growth 5-32 

implementation of high-to-low 5-32 

implementation of low-to-high 5-33 

standard branches 6-8 

status register (ST) 3-4, 1 0-1 1 

bits summary 3-6 

CPU register file 3-5 

global interrupt enable (GIE) bit 

'C30 interrupt considerations 6-32 
'C3x interrupt considerations 6-3 1 

STF and ABSF instructions (parallel) 
10-23 to 10-24 

STF and ADDF3 instructions (parallel) 
10-35 to 10-36 

STF and FLOAT instructions (parallel) 
10-80 to 10-81 

STF and LDF instructions (parallel) 1 0-93 to 1 0-94 

STF and MPYF3 instructions (parallel) 
10-1 22 to 10-123 

STF and NEGF instructions (parallel) 
10-1 40 to 10-141 

STF and STF instructions (parallel) 
10-1 76 to 10-177 

STF and SUBF3 instructions (parallel) 

10-1 90 to 10-191 
STF instruction 1 0-1 74 
STFI instruction 10-175 

STI and ABSI instructions (parallel) 1 0-27 to 1 0-28 

STI and ADDI3 instructions (parallel) 
10-40 to 10-41 

STI and AND3 instructions (parallel) 
10-45 to 10-46 

STI and ASH3 instructions (parallel) 
10-54 to 10-55 

STI and FIX instructions (parallel) 1 0-77 to 1 0-78 

STI and LDI instructions (parallel) 
10-102 to 10-103 

STI and LSH3 instructions (parallel) 
10-112 to 10-114 



STI and MPYI3 instructions (parallel) 

10-1 33 to 10-134 
STI and NEQI instructions (parallel) 

10-1 43 to 10-144 
STI and NOT instructions (parallel) 

10-1 49 to 10-150 
STI and OR3 instructions (parallel) 

10-1 54 to 10-155 
STI and STI instructions (parallel) 

10-1 80 to 10-181 
STI and SUBI3 instructions (parallel) 

10-1 95 to 10-196 
STI and XOR3 instructions (parallel) 

10-209 to 10-210 
STI instruction 10-178 
STII instruction 10-179 
store floating-point instruction 1 0-1 74 
store floating-point, interlocked instruction 1 0-1 75 
store integer instruction 1 0-1 78 
store integer, interlocked instruction 1 0-1 79 
STRB signal 7-2,7-6 
stress testing C-2 to C-6 
style (manual) viii 
SUBB instruction 1 0-1 82 
SUBB3 instruction 1 0-1 83 to 1 0-1 84 
SUBC instruction 1 0-1 85 to 1 0-1 86 
SUBF instruction 1 0-1 87 
SUBF3 and MPYF3 instructions (parallel) 

10-1 24 to 10-126 
SUBF3 and STF instructions (parallel) 

10-1 90 to 10-191 
SUBF3 instruction 1 0-1 88 to 1 0-1 89 
SUBI instruction 10-192 
SUBI3 and MPYI3 instructions (parallel) 

10-1 35 to 10-137 
SUBI3 and STI instructions (parallel) 

10-1 95 to 10-196 
SUBI3 instruction 1 0-1 93 to 1 0-1 94 
SUBRB instruction 1 0-1 97 
SUBRF instruction 1 0-1 98 
SUBRI instruction 1 0-1 99 
subroutines 

computed GOTO 1 1 -22 

context switching 1 1 -1 1 to 1 1 -1 5 

context restore for 'C3x 11-14 to 11-16 
context save for 'C3x 11-12 to 11-13 
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subroutines (continued) 

interrupt priority 1 1 -1 6 to 1 1 -1 8 

program control 1 1 -6 to 1 1 -8 

runtime select 1 1 -20 to 1 1 -21 
subtract example 1 1 -39 
subtract floating-point instruction 1 0-1 87 

3-operand instruction 1 0-1 88 
subtract integer conditionally instruction 1 0-1 85 
subtract integer instruction 1 0-1 92 

3-operand instruction 1 0-1 93 

subtract integer with borrow instruction 1 0-1 82 
3-operand instruction 1 0-1 83 

subtract reverse floating-point instruction 1 0-1 98 

subtract reverse integer instruction 1 0-1 99 

subtract reverse integer with borrow instruction 
10-197 

supply current calculations D-26 to D-27 

average D-27 

data output D-26 to D-27 

experimental results D-27 

processing D-26 
SWI instruction 10-200 
symbols (used in manual) viii 
symbols and abbreviations 1 0-1 4 to 1 0-1 5 
synchronize two processors example 6-1 7 
syntaxes, assembler 1 0-1 6 to 1 0-1 8 
system 

control functions 1 2-27 to 1 2-31 

clock oscillator circuitry 12-27 to 12-29 
reset signal generation 12-29 to 12-3 1 

integration 2-32 
system configuration 

categories of interfaces 1 2-2 

options overview 1 2-2 to 1 2-3 

typical system block diagram 1 2-3 to 1 2-4 

system management 5-31 to 5-34 
system stack pointer 5-31 




target, system, connection 1 2-39 to 1 2-46 

target cable 12-39,12-43 

target system, connection to emulator 

12-41 to 12-43 
technical assistance x 



technical training organization (TTO) workshop 
B-6 

telecommunications applications F-5 to F-1 0 

telecommunications-related devices F-7 

test bit fields instruction 1 0-203 
3-operand instruction 1 0-204 

test bus controller 1 2-45 
test load circuit 13-28 
test setup description D-4 to D-5 
third parties B-4 

32-bit-wide configured memory 3-32 
three-operand addressing modes 2-16, 

5-20 to 5-21 
three-operand instructions 1 0-4 

timer 2-28 

control register 8-11 

receive/transmit 8-21 to 8-22 
counter register 8-8 

receive/transmit 8-22 
global-control register 8-3 to 8-8 

bits summary 8-4 to 8-6 
I/O port configurations 8-1 0 
initialization/reconfiguration 8-1 2 to 8-1 5 
interrupts 8-11 
operation modes 8-1 0 to 8-1 1 
output generation examples 8-9 
period register 8-2, 8-8 

receive/transmit 8-23 
pulse generation 8-8 to 8-9 
registers 8-42 
timing figure 8-7 

timers 8-2 to 8-12 
counter 8-2 

timing 

external interface 7-6 to 7-27 

expansion bus I/O cycles 7-11 to 7-32 
primary bus cycles 7-6 to 7-10 
parameters 1 3-30 to 1 3-67 

changing the XFpin from an input to an 

output 13-44 
changing the XFpin from an output to an 

input 13-43 
data rate timing modes 13-55 to 13-60 
general-purpose I/O timing 13-63 to 13-65 

peripheral pin I/O modes 1 3-63 to 1 3-65 

peripheral pin I/O timing 1 3-63 
interrupt acknowledge timing 13-54 
interrupt response timing 13-52 to 13-53 
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timing, parameters (continued) 

loading when the XF pin is configured as an 

output 13-42 
memory read/write timing 13-32 to 13-37 
rese t timing 13-45 to 13-50 
SHZpin timing 13-51 
timer pin timing 13-66 to 13-67 
X2/CLKIN, H1, and H3 13-30 to 13-31 
XFO and XF1 timing when executing LDFI or 

LDII 13-38 to 13-39 
XFO andXFI timing when executing SIGI 

13-41 

XFO and XF1 timing when executing STFI or 
STII 13-40 

TINTO, TINT1 signals 3-1 8, 3-1 9, 6-24 

TLC32046, F-3 

TLC32070, F-16 

TMS320 

DSP evolution 1-3 

family, general description 1 -2 

TMS320C30 

FFT assembly code D-30 to D-32 
memory maps 2-14 
photo of IDD for FFT D-29 
power dissipation D-1 to D-32 
summary D-28 

TMS320C30 and TMS320C31 differences 2-30 
data/program bus differences 2-30 
development considerations 2-31 
effects on the IF and IE interrupt registers 2-31 
reserved memory locations 2-30 
serial-port differences 2-30 
user program/data ROM 2-31 

TMS320C31 

interrupt and trap memory maps 3-34 

memory maps 2-15 

reserved memory locations 2-31 

TMS320C3x block diagram 
architectural 2-3 
functional 1-5 

TMS320C3X DSPs 1-1 to 1-2 

TMS320C3x family, general description 1 -2 

TMS320C3x interfaces 1 2-1 

TMS320C3X 

serial-port interface examples 8-36 to 8-42 



TMS320LC31 power management 

modes 6-36 to 6-38 

IDLE2 6-36 to 6-37 

LOPOWER 6-38 
total supply current calculation D-1 8 to D-25 

average current D-22 

average current versus peak current D-22 

combining D-1 8 to D-1 9 

dependencies D-1 9 to D-20 

design equation D-21 to D-22 

peak current D-22 

thermal management considerations 
D-23 to D-25 
trap conditionally instruction 1 0-201 
trap vectors 3-18,3-19 
TRAPcond instruction 6-1 0, 1 0-201 to 1 0-202 

traps 3-17, 6-10 to 6=11 
interrupt considerations 
'C30 6-32 to 6-34 
'C3x 6-31 

TSTB instruction 1 0-203 

TSTB3 instruction 1 0-204 to 1 0-205 

two-operand instructions 1 0-3 



U 



U-law compression 1 1 -54 
U-law expansion 1 1 -55 
underflow 4-14 

unsigned-integer format 4-3 
short 4-3 

single-precision 4-3 

user state management 5-31 



V 



variable data-rate timing operation 8-34 
burst mode 8-34 
continuous mode 8-35 

vectors 

interrupts 3-17, 6-35 
reset 3-17,6-35 
trap 3-17 

video signal processing F-21 
voice synthesizers F-1 1 
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W 



wait states 

external bus 1 2-9 to 1 2-1 3 
programmable 7-28 to 7-29 
zero 12-4 to 12-8 

workshops B-6 




XDS, target design considerations 1 2-39 to 1 2-46 
connections between emulator and target 

system 12-41 to 12-43 
designing MPSD emulator connector 

12-39 to 12-40 
diagnostic applications 1 2-45 to 1 2-46 



XDS, target design considerations (continued) 
mechanical dimensions of emulator connector 

12-43 to 12-45 
MPSD emulator cable signal timing 
12-40 to 12-41 

XDS510 emulator B-3 

XFO, XF1 signals 2-26 

XINTO, XINT1 signals 3-1 8, 3-1 9, 6-24 

XOR instruction 1 0-206 

XOR3 and STI instructions (parallel) 
10-209 to 10-210 

XOR3 instruction 1 0-207 to 1 0-208 



Z 



zero condition flag 10-11 
zero-logic interconnect of 'C3x 6-1 6 
zero-overhead looping 6-2 
zero-wait-states 1 2-4 to 1 2-8 
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H4S1R7, (514) 335-8392. 
MEXICO: Texas Instruments de Mexico S.A. de 
C.V., Xola 613, Modulo 1-2, Colina del Valle, 
03100 Mexico, D.F., 5-639-9740. 



AUSTRALIA (& NEW ZEALAND): Texas 
Instruments Australia Ltd.. 6-1 0 Talavera Road, 
North Ryde (Sydney), New South Wales, 
Australia 2113, 2-878-9000; 14th Floor, 380 Street, 
Kilda Road, Melbourne, Victoria, Australia 3000, 
3-696-1211. 

BELGIUM: Texas Instruments Belgium S.A./N.V, 

Avenue Jules Bordetlaan 11 , 1140 Brussels, 

Belgium, (02) 242 30 80. 

BRAZIL: Texas Instruments Electronicos do 

Brasil Ltda., Av. Eng. Luiz Carlos Berrini, 1461, 11 

andar, 04571 -903, Sao Paulo, SP, Brazil, 

11-535-5133. 

DENMARK: Texas Instruments A/S, Borupvang 
2D, 2750 Ballerup, Denmark, (44) 68 74 00. 
FINLAND: Texas Instruments OY, Tekniikantie 12, 
02150 Espoo, Finland, (0) 43 54 20 33. 
FRANCE: Texas Instruments France, 8-10 
Avenue Morane-Sauinier, B.P. 67, 78141 Velizy- 
Villacoublay Cedex, France, (1) 30 70 10 01. 
GERMANY: Texas Instruments Deutschland 
GmbH., HaggertystraBe 1 , 85356 Freising, 
Germany, (08161) 80-0; Kirchhorster Strape 2, 
30659 Hannover, Germany, (0511) 90 49 60; 
MaybachstraBe II, 73760 Ostfildern, Germany, 
(0711)34 03 0. 

HONG KONG: Texas Instruments Hong Kong Ltd., 

8th Floor, World Shipping Centre, 7 Canton Road, 

Kowloon, Hong Kong, 737-0338. 

HUNGARY: Texas Instruments Representation, 

Budaorsi u.50, 3rd floor, 1112 Budapest, 

Hungary, (1)269 8310. 

INDIA: Texas Instruments India Private Ltd., 

AL-Aabeeb, 150/1 Infantry Road, Bangalore 

560 001 , India, (91-80) 226-9007. 

IRELAND: Texas Instruments Ireland Ltd., 

7/8 Harcourt Street, Dublin 2, Ireland, 

(01)475 52 33. 

ITALY: Texas Instruments Italia S.pA, Centra 
Direzionale Colleoni, Palazzo Perseo-Via 
Paracelso 12, 20041 Agrate Brianza (Mi), Italy, 
(039) 63 221 ; Via Castello delta Mag liana, 38, 
00148 Roma, Italy (06) 657 26 51. 
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JAPAN: Texas Instruments Japan Ltd., Aoyama 
Fuji Building 3-6-12 Kita-Aoyama Minato-ku, Tokyo, 
Japan 107, 03-498-12111; MS Shibaura 
Building 9F, 4-13-23 Shibaura, Minato-ku, Tokyo, 
Japan 108, 03-769-8700; Nissho-lwai Building 5F, 
2-5-8 Imabashi, Chuou-ku, Osaka, Japan 541 , 
06-204-1881; Dai-ni Toyota Building Nishi-kan 7F, 
4-10-27 Meieki, Nakamura-ku, Nagoya, Japan 450, 
052-583-8691 ; Kanazawa Oyama-cho Daiichi 
Seimei Building 6F, 3-10 Oyama-cho, 
Kanazawa-shi, Ishikawa, Japan 920, 
0762-23-5471 ; Matsumoto Showa Building 6F, 
1-2-11 Fukashi, Matsumoto-shi, Nagano, Japan 
390, 0263-33-1060; Daiichi Olympic Tachikawa 
Building 6F, 1-25-12, Akebono-cho, Tachikawa-shi, 
Tokyo, Japan 190, 0425-27-6760; Yokohama 
Business Park East Tower 1 0F, 134 Goudo-cho, 
Hodogaya-ku, Yokohama-shi, Kanagawa, Japan 
240, 045-338-1220; Nihon Seimei Kyoto Yasaka 
Building 5F, 843-2, Higashi Shiokohji-cho, 
Higashi-iru, Nishinotoh-in, Shiokohii-dori, 
Snimogyo-ku, Kyoto, Japan 600, 075-341-7713; 
Sumitomo Seimei Kumagaya Building 8F, 2-44 
Yayoi, Kumagaya-shi, Saitama, Japan 360, 
0485-22-2440; 4262, Aza Takao, Oaza Kawasaki, 
Hijl-Machi, Hayami-Gun, Oita, Japan 879-15, 
0977-73-1557. 

KOREA: Texas Instruments Korea Ltd., 28th Floor, 

Trade Tower, 159-1, Samsung-Dong, Kangnam-ku 

Seoul, Korea, 2-551-2800. 

MALAYSIA: Texas Instruments, Malaysia, SDN. 

BHD., Lot 36.1 #Box 93, Menara Maybank, 

100 Jalan Tun Perak, 50050 Kuala Lumpur, 

Malaysia, 50-3-230-6001. 

NORWAY: Texas Instruments Norge A/S, P.B. 106, 

Brin Sveien 3, 0513 Oslo 5, Norway, 

(02)26475 70. 

PEOPLE'S REPUBLIC OF CHINA: Texas 
Instruments China Inc., Beijing Representative 
Office, 7-05 CITIC Building, 19 Jianguomenwai 
Dajie, Beijing, China, 500-2255, Ext. 3750. 
PHILIPPINES: Texas Instruments Asia Ltd., 
Philippines Branch, 14th Floor, Ba-Lepanto Building, 
8747 Paseo de Roxas, 1226 Makati, Metro Manila, 
Philippines, 2-81 7-6031. 
PORTUGAL: Texas Instruments Equipamento 
Electronico (Portugal) LDA., Eng. Frederico Ulricho, 
2650 Moreira Da Maia, 4470 Maia, Portugal 
(2)94810 03. 

SINGAPORE (& INDONESIA, THAILAND): Texas 
Instruments Singapore (PTE) Ltd., 
990 Bendemeer Road, Singapore 1233, 
(65)390-7100. 

SPAIN: Texas Instruments Espafta S.A., 
c/Gobelas 43, 28023, Madrid, Spain, (1) 372 80 51; 
Pare Technologic Del Valles, 08290 Cerdanyola, 
Barcelona. Spain, (3) 31 791 80. 
SWEDEN: Texas Instruments International Trade 
Corporation (Sverigefilialen), Box 30, 164 93, 
Isafjordsgatan 7, KTsta, Sweden, (08) 752 58 00. 
SWITZERLAND: Texas Instruments Switzerland 
AG, Riedstrasse 6, CH-8953 Dietikon, Switzerland, 

(01) 744 2811. 

TAIWAN: Texas Instruments Taiwan Limited, 
Taipei Branch, 23th Floor, Sec. 2, Tun Hua S. 
Road, Taipei 106, Taiwan, Republic of China, 

(2) 378-6800. 

UNITED KINGDOM: Texas Instruments Ltd., 
Manton Lane, Bedford, England, MK41 7PA, 
(0234)270111. 
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